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TITLE OF THE INVENTION: Secreted soluble a 7 5-2, a?S-3 or a?5-4 calcium 
channel subunit polypeptides and screening assays using same 

Background of the invention 



CO 



CO 
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Voltage-dependent Ca 2+ channels (VDCCs) are heteromultimeric complexes present in both w 
neuronal and non-neuronal tissues, including heart and skeletal muscle. VDCCs are d 
minimally composed of three subunits: a pore-forming transmembrane cti subunit, a 
10 hydrophilic intracellular P subunit, and a membrane-associated cx.28 subunit; a 
transmembrane y subunit is also found in skeletal muscle tissue. Multiple subtypes and/or 
splice variants of the cti, P, and ct25 subunits have been found. S 

W 

Gabapentin ((l-aminomethyl)cyclohexane acetic acid or Neurontin) is a structural analogue 
15 of GAB A, which is mainly used as an adjunctive therapy for epilepsy. Recent research 
suggests that gabapentin may also have clinical utility for various indications including 
anxiety and pain. Although designed as a lipophilic GABA-mimetic, gabapentin does not 
have a high affinity for either GABA A or GABA B receptors, GABA uptake sites, or the 
GABA-degrading enzyme GABA-transaminase (EC 2.6.1.19). 

20 

A novel high affinity binding site for [ 3 H]gabapentin in rat, mouse, and porcin brains has 
been characterized. Recently, the [ 3 H]gabapentin-binding protein was isolated from pig 
brain and identified as the 0126- 1 subunit of VDCCs. None of the prototypic anticonvulsant 
drugs displace [ 3 H]gabapentin binding from the 0:28- 1 subunit. [ 3 H]Gabapentin-binding is 
25 stereospecifically inhibited by two enantiomers of 3-isobutyl GABA. The rank order of 
potency of gabapentin, and S- and R-isobutyl GABA, at the [ 3 H]gabapentin binding site 
mirrors their anticonvulsant activity in mice. However, electrophysiological studies have 
yielded conflicting data on the action of gabapentin at VDCCs. 



30 The a.28 subunit is derived from a single gene, the product of which is extensively post- 
translationally modified particularly through the cleavage of the signal sequence. The 
polypeptide is cleaved to form disulfide-bridged 0C2 and 5 peptides, both of which are 
heavily glycosylated. Although it seems clear today that the a.2 and 8 peptides are 
membrane-associated peptides, it is unclear whether these peptides comprise one or several 

35 transmembrane domains. Furthermore, the location, size and structural configuration of 
these eventual transmembrane domains remains to be determined. 



2 

But in any event, the fact that 0C2S is a membrane-associated protein, regardless of its 
precise structural configuration, renders its large scale expression in recombinant systems 
difficult. Indeed, as the 0.28 protein is targeted to the membrane, it requires detergent 
solubilisation to release it for purification. This important drawback imposes considerable 
5 restrictions for any potential applications requiring large amounts of recombinant protein. 
Furthermore, the various subtypes of 0:26 subunits are different proteins with very low 
homologies. It is therefore extremely difficult to predict their respective behaviors, for 
example in gene truncation experiments. 

10 The only assay currently available for the screening of ligands that bind the 0:26 subunit 
involves the use of pig membrane extracts as a source of the 0:28 subunit. Such an assay 
presents major inconvenients. Firstly, bcause the assay material is a membrane extract, it is 
very difficult to accurately determine the protein composition from one assay preparation 
to another particularly with regard to the subtype. Also, the presence of various impurities 

15 in the assay preparation is a problem in small plate assays. Furthermore, as the protein 
preparation lacks homogeneity, the interaction between the targeted protein and the assay 
plate is often quite uneven. This rendres the streamlining of the assay in a high throughput 
format almost impossible to achieve. 

20 

Summary of the invention 

In the context of the present invention, the inventors have found that it was possible to 
delete a portion of the nucleotide sequence encoding a eukaryotic, preferably a mammal 
25 cerebral cortical voltage-dependent calcium channel 0:2 subunit to yield a soluble secreted 
protein which retains its affinity for [ 3 H]gabapentin. 

The invention concerns: 

30 1) A purified or isolated nucleic acid encoding a mammalian secreted soluble cerebral 
cortical voltage-dependent calcium channel 0.26-2, 0C28-3 or CC2S-4 subunit polypeptide. 

2) A purified or isolated nucleic acid according to 1), comprising a polynucleotide having 
at least 90% identity with the sequence encoding : 



\ 
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- from amino-acid 1 to between amino-acids 1027 and 1062 of SEQ ED N°20 for a.28-2, 

- from amino-acid 1 to between amino-acids 984 and 1019 of SEQ ED N°22 for 0.28-3. 

3) A purified or isolated nucleic acid according to 1), having at least 90% identity with the 
5 sequence encoding : 

- from amino-acid 1 to between amino-acids 1047 and 1062 of SEQ ED N°20 for 028-2, 

- from amino-acid 1 to between amino-acids 1004 and 1019 of SEQ ED N°22 for 0.28-3. 

4) A purified or isolated nucleotide sequence according to 1) wherein said sequence is the 
10 sequence of SEQ ED N°l, SEQ ED N°2, SEQ ID N°3, SEQ ED N°7, SEQ ED N°8, SEQ ED 

N°9, SEQ ED N°13, SEQ ED N°14, SEQ ED N°15, SEQ ED N°19 or SEQ ED N°21. 

5) A purified or isolated nucleic acid, having at least 90% identity with the nucleotide 
sequence of SEQ ID N° 19 or SEQ ED N°21 . 

15 

6) A purified or isolated polynucleotide comprising at least 10 consecutive nucleotides of 
the nucleotide sequence of SEQ ED N° 19 or SEQ ED N°21. 

7) A polynucleotide probe or primer hybridizing, under stringent conditions, with the with 
20 the nucleotide sequence of SEQ ED N° 1 9 or SEQ ED N°2 1 . 

8) A method for the amplification of a nucleic acid encoding a mammalian secreted 
soluble cerebral cortical voltage-dependent calcium channel ct28-n subunit polypeptide 
wherein n is 2, 3 or 4, said method comprising the steps of: 

25 (a) contacting a test sample suspected of containing the target secreted soluble 028- 

n subunit nucleic acid, or a sequence complementary thereto, with an amplification 
reaction reagent comprising a pair of amplification primers located on either side of 
the o^S-n subunit nucleic acid region to be amplified, and 
(b) optionally, detecting the amplification products. 

30 

9) A kit for the amplification of a nucleic acid encoding a secreted soluble a28-n subunit 
polypeptide wherein n is 2, 3 or 4, or a complementary sequence thereto in a test sample, 
wherein said kit comprises: 



> 
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(a) a pair of oligonucleotide primers which can hybridize, under stringent 
conditions, to the secreted soluble cc 2 5-n subunit nucleic acid region to be 
amplified; 

(b) optionally, the reagents necessary for performing the amplification reaction. 

5 

10) A recombinant vector comprising a nucleic acid according to 1). 

1 1) A recombinant host cell comprising a nucleic acid according to 1). 

10 12) A method for producing a secreted soluble a 2 5-n subunit wherein n is 2, 3 or 4, and 
said method comprises the steps of: 

(a) inserting the nucleic acid encoding the desired a 2 5-n subunit polypeptide in an 
appropriate vector; 

(b) culturing, in an appropriate culture medium, a host cell previously transformed 
15 or transfected with the recombinant vector of step (a); 

(c) harvesting the culture medium thus obtained or lyse the host cell, for example 
by sonication or osmotic shock; 

(d) separating or purifying, from said culture medium, or from the pellet of the 
resultant host cell lysate, the thus produced a 2 5-n subunit polypeptide of interest. 

20 

13) A purified or isolated recombinant polypeptide comprising the amino acid sequence of 
a secreted soluble a 2 8-2, a 2 S-3 or a 2 8-4 subunit polypeptide. 

14) A recombinant polypeptide according to 14), having at least 80% amino-acid identity 
25 with a polypeptide comprising : 

- from amino acid 1 to between amino acids 1027 and 1062 of the amino acid sequence 
ofSEQIDN°20, or 

- from amino acid 1 to between amino acids 1019 and 1079 of the amino acid sequence 
of SEQIDN 0 22. 

30 

15) A recombinant polypeptide according to 14), wherein said recombinant polypeptide is 
selected from the group consisting of the amino acid sequences of SEQ ID n°4, SEQ ID 
n°5, SEQ ID n°6, SEQ ID n°10, SEQ ID n°ll, SEQ ID n°12, SEQ ID n°16, SEQ ID n°17 
and SEQ ID n°18. 



5 



16) A method for the screening of ligands which bind a cerebral cortical voltage- 
dependent calcium channel 0C25-n subunit wherein n is 2, 3 or 4, said method comprising 
the steps of: 

5 - contacting a secreted soluble recombinant calcium channel a 2 8-n subunit 

polypeptide with: 

- a ligand of interest; and 

a labelled compound which binds the a 2 8-n subunit; and 
- measuring the level of binding of the labelled compound to the cc 2 5-n subunit. 

0 

17) A method according to 16), wherein said method is a scintillation proximity assay. 



18) A method according to 16), wherein said method is a flashplate assay. 
15 19) A method according to 16), wherein said method is a filter binding assay. 



20) A method according to 16), wherein said secreted soluble recombinant calcium 
channel a 2 8-n subunit polypeptide is selected from polypeptides having at least 80%, 
preferably 90%, more preferably 95%, and most preferably 98 or 99% amino-acid identity 

20 with the polypeptide comprising from amino acid 1 to between amino-acids 984 and 1063, 
preferably between amino-acids 994 and 1054, and most preferably between amino-acids 
1019 and 1054 of SEQ ID N°5 or SEQ ID N°16. 

21) A method according to 16), wherein said secreted soluble recombinant calcium 
25 channel oc 2 5-n subunit polypeptide is selected from the group consisting of SEQ ID N°6, 7, 

8, 9, 13, 14 and 15, with the polypeptides of SEQ ID N°9 and SEQ ID N°15 being most 
preferred. 

22) A kit for the screening of ligands which bind bind a cerebral cortical voltage- 
30 dependent calcium channel a 2 8-n subunit wherein n is 2, 3 or 4, said kit comprising: 

- a secreted soluble recombinant calcium channel a 2 S-n subunit; and 

- a labelled compound which binds to the a 2 5-n subunit. 
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Hence, the invention concerns nucleotide sequence fragments of a cerebral cortical voltage 
dependent calcium channel a 2 8-2, a 2 8-3 or a 2 5-4 subunit cDNA encoding a soluble 
secreted a 2 8-2, a 2 8-3 or a 2 8-4 subunit polypeptide(hereinafter a a 2 8-2, a 2 S-3 or a 2 8-4 
5 subunit)^ Preferably, these nucleotide sequences encode a soluble secreted a 2 8-2, a 2 8-3 or 
a 2 6-4 subunit polypeptide bearing a gabapentin or a [ 3 H]gabapentin binding site. More 
preferably, the soluble secreted a 2 8-2, a 2 8-3 or a 2 S-4 subunit nucleic acid is derived from 
a eukaryotic, preferably a mammal, more preferably a human a 2 S-2, a 2 8-3 or a 2 8-4 
subunit. 

10 

A further object of the present invention concerns recombinant vectors comprising a 
nucleic acid sequence encoding a soluble secreted cc 2 S-2, cc 2 S-3 or a 2 S-4 subunit 
polypeptide. 

The invention also encompasses host cells and transgenic non-human mammals 
1 5 comprising said nucleic acid sequences or recombinant vectors. 

The invention also concerns a soluble secreted a 2 S-2, a 2 S-3 or a 2 8-4 subunit polypeptide 
which is characterized in that it is a soluble secreted polypeptide having affinity for 
[ 3 H]gabapentin. Preferably, the soluble secreted polypeptide is derived from a mammal, 
20 more preferably a human a 2 8-2, a 2 S-3 or a 2 8-4 subunit. 

The inventors have also found that it was possible to use a soluble secreted form of a 
voltage-dependant calcium channel cc 2 S-2, a 2 8-3 or a 2 8-4 subunit polypeptide in an assay 
for the screening of ligands which bind the oc 2 5-2, a 2 5-3 or a 2 5-4 subunit. 

25 

The invention therefore also concerns a method for the screening of ligands which bind a 
calcium channel ot 2 8-2, a 2 8-3 or a 2 S-4 subunit. 
The method comprises the steps of: 

- contacting a secreted soluble recombinant calcium channel a 2 8-2, a 2 8-3 or a 2 8-4 
30 subunit polypeptide with: 

a ligand of interest; and 
- a labelled compound which binds a cc 2 S-2, a 2 8-3 or a 2 8-4 subunit; and 

- measuring the level of binding of the labelled compound to the secreted soluble 
a 2 8-2, a 2 8-3 or a 2 8-4 subunit. 

35 
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The invention also concerns a kit for the screening of ligands which bind a calcium 
channel 0126-2, a 2 8-3 or a 2 S-4 subunit. 
The kit comprises: 

- a secreted soluble recombinant calcium channel a 2 8-2, a 2 5-3 or OC25-4 
5 subunit polypeptide; and 

- a labelled compound which binds a calcium channel a 2 S-2, a 2 8-3 or 
a 2 8-4 subunit. 

10 Detailed description of the invention 

The invention concerns truncated ot 2 8-2, cc 2 8-3 or a 2 S-4 subunit cDNA sequences. These 
truncated sequences encode soluble secreted polypeptides which retain their affinity for 
[ 3 H]gabapentin. 

15 

Throughout the present specification, the expression "nucleotide sequence" is used to 
designate indifferently a polynucleotide or a nucleic acid. More precisely, the expression 
"nucleotide sequence" encompasses the nucleic material and the sequence information and 
is not restricted to the sequence information (i.e. the succession of letters chosen among the 
20 four base letters) that biochemically characterizes a specific DNA or RNA molecule. 

As used interchangeably herein, the terms "oligonucleotides", "nucleic acids" and 
"polynucleotides" include RNA, DNA, or RNA/DNA hybrid sequences of more than one 
nucleotide in either single chain or duplex form. 

Further to its general meaning understood by the one skilled in the art, the term 
25 "nucleotide" is also used herein to encompass modified nucleotides which comprise at 

least one of the following modifications (a) an alternative linking group,. (b) an analogous 

form of purine, (c) an analogous form of pyrimidine, or (d) an analogous sugar. For 

examples of analogous linking groups, purines, pyrimidines, and sugars, see for example 

PCT publication N°WO 95/04064. 
30 The polynucleotide sequences of the invention may be prepared by any know method, 

including synthetic, recombinant, or a combination thereof as well as through any 

purification methods known in the art. 

A) Secreted a28-2, a25-3 or a.28-4 subunit polypeptides 

35 The invention comprises polynucleotide sequences encoding a soluble secreted eukaryotic, 
preferably a soluble secreted mammal a 2 6-2, a 2 8-3 or a 2 8-4 subunit polypeptide. These 
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sequences particularly include but are not restricted to 1) those sequences encoding a 
soluble secreted polypeptide of this 0:26-2, a 2 S-3 or a 2 S-4 subunit which preferably retains 
its binding affinity for [ 3 H]gabapentin and 2) nucleotide fragments useful as nucleic acid 
primers or probes for amplification or detection purposes. 
5 The expression "soluble secreted a 2 8-2, a 2 8-3 or a 2 8-4 subunit" is intended to designate 
polypeptide sequences which, when produced by a recombinant host cell, are secreted at 
least partially into the culture medium rather than remaining associated with the host cell 
membrane. 

10 1) cDNA fragments encoding soluble secreted a?8-2, aaS-3, 0^8-4 subunit 
polypeptides 

One of the important embodiments of the present invention concerns truncated nucleotide 
sequences of a 2 8-2, a 2 8-3 or a 2 8-4 subunit cDNAs which encode soluble secreted a 2 8-2, 
a 2 8-3 or a 2 S-4 subunit polypeptides. The inventors have found that it was possible to 

1 5 generate deletion mutants of a 2 8-2, a 2 8-3 or a 2 8-4 subunit cDNAs which, when expressed, 
produce a significant amount of soluble secreted proteins, preferably soluble secreted 
proteins, which retain their [ 3 H]gabapentin binding affinity. These truncated nucleotide 
sequences of the invention are of significant value to the skilled person as they now allow 
fast and reliable access to significant concentrations of selected soluble secreted a 2 8-2, 

20 a 2 8-3 or a 2 S-4 subunit polypeptides. To that end, the inventors have determined the 
minimal and optimal fragment lengths required to express a polypeptide which: 1) binds 
[ 3 H]gabapentin with sufficient affinity and; 2) is obtained in a soluble secreted form. 
The discussion provided below provides comments on possible truncations, giving as an 
example the human a 2 8-2, a 2 8-3 or a 2 8-4 subunit. However, given the very substantial 

25 cross-species homology for ct 2 8-2, cc 2 S-3 or a 2 8-4 subunit sequences, the comments below 
can also be applied to other eukaryotic species, and more particularly other mammalian 
species such as rat, mouse, rabbit or pig. Their a 2 5-2, a 2 S-3 or a 2 8-4 subunit sequences, 
which for most are available in public databases, share a very substantial homology with 
the human oc 2 S-2, oc 2 S-3 or a 2 8-4 subunit sequences. 

30 

The inventors believe that the soluble secreted a 2 5-2 subunit polypeptides which are as 
close as possible to the native sequence and which are therefore more likely to retain their 
native folding and hence their [ 3 H]gabapentin binding properties are those corresponding 

to the native protein in which amino-acid stretch 1027 to the C-terminal end of 

35 the amino-acid sequence of SEQ ID N°20 has been deleted. The skilled scientist can quite 
easily determine within this amino-acid stretch the optimal oc 2 S-2 subunit polypeptides. 

The inventors also believe that the soluble secreted a 2 8-3 subunit_polypeptides which are 
as close as possible to the native sequence and which are therefore more likely to retain 
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their native folding and hence thir [ 3 H]gabapentin binding properties are those 

corresponding to the native protein in which amino-acid stretch 984 to _C-terminal 

end of the amino-acid sequence of SEQ ID N°22 has been deleted. The skilled scientist 

can quite easily determine within this amino-acid stretch the optimal a.28-3 subunit 
5 polypeptides. 

The invention therefore particularly concerns a nucleotide sequence encoding a 
polypeptide having at least 80% identity with the polypeptide comprising from amino-acid 

1 to between amino-acids _1027 and _1145 , preferably to between amino-acids 

10 _1062_ and _1 145 of SEQ ID N°20. 

Preferred nucleotide sequences include those of SEQ ID N°l, SEQ ID N° 2 and SEQ ID 
N°3. 

The invention also concerns a nucleotide sequence encoding a polypeptide having at least 
15 80% identity with the polypeptide comprising from amino-acid 1 to between amino-acids 

984_ and 1085 , preferably tobetween amino-acids 1019 and _1085 of SEQ ID 

N°22. 

Preferred nucleotide sequences include those of SEQ ID N°7, SEQ ID N° 8 and SEQ ID 

N°9. 

20 

B) Amplification of the soluble secreted 0,28-2, ct 2 8-3 or 0.78-4 subunit nucleotide 
sequences 

Another object of the invention consists of a method for the amplification of a nucleic acid 
encoding a soluble secreted a 2 S-2, a 2 5-3 or a 2 5-4 subunit polypeptide, preferably a 
25 polypeptide bearing a [ 3 H]gabapentin binding site, said method comprising the steps of: 

(a) contacting a test sample suspected of containing the target ct 2 5-2, a 2 S-3 or a 2 8-4 
subunit nucleic acid, a fragment or a variant thereof, or a sequence complementary thereto, 
with an amplification reaction reagent comprising a pair of amplification primers which 
can hybridize under stringent conditions, the a 2 S-2, a 2 8-3 or a 2 5-4 subunit nucleic acid 

30 region to be amplified, and 

(b) optionally, detecting the amplification products. 

The expression [ 3 H]gabapentin binding site, when used herein is intented to designate a 
site which can bind either [ 3 H]gabapentin or other ligands such as (S+)-3-isobutyl gaba or 
(R-)-3-isobutyl gaba. 

35 In a first preferred embodiment of the above method, the nucleic acid encodes a secreted 
soluble ct 2 8-2, a 2 8-3 or a 2 8-4 subunit polypeptide of SEQ ID N°4, SEQ ID N°5, SEQ ID 
N°6, SEQ ID N°10, SEQ ID N°ll, SEQ ID N°12, SEQ ID N°16, SEQ ID N°17 and SEQ 
IDn 0 18. 
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In a second preferred embodiment of the above amplification method, the amplification 
product is detected by hybridization with a labelled probe having a sequence which is 
complementary to the amplified region. 

5 

C) Recombinant vectors and hosts cells for the expression of a secreted soluble 0^5-2. 
ai8-3 or 0:26-4 subunit polypeptide 

1) Recombinant vectors 

10 The present invention also encompasses a family of recombinant vectors comprising any 
one of the nucleic acids described herein. Firstly, the invention deals with a recombinant 
vector comprising a nucleic acid selected from the group consisting of: 

(a) a purified or isolated nucleic acid encoding a a secreted soluble <X2&-2, OC2S-3 or 
a 2 5-4 subunit having at least 80% amino acid identity with the polypeptide of SEQ ID 

15 N°20 or 22, or a sequence complementary thereto; 

(b) a purified or isolated nucleic acid having at least 90% nucleotide identity with a 
polynucleotide selected from the group consisting of the nucleotide sequences of SEQ ID 
N°l, SEQ ID N°2, SEQ ID N°3, SEQ ID No 7, SEQ ID N°8, SEQ ID N°9, SEQ ID No 13, 
SEQ ID N°14, SEQ ID N° 15 or a sequence complementary thereto; 

20 ( c ) a purified or isolated polynucleotide comprising at least 10 consecutive 

nucleotides of a nucleic acid described in (a) or (b) or a sequence complementary thereto. 

In a first preferred embodiment a recombinant vector of the invention is used to amplify 
the inserted polynucleotide of the invention in a suitable host cell, this polynucleotide 

25 being amplified every time the recombinant vector replicates. 

Recombinant expression vectors comprising a nucleic acid encoding secreted soluble 0,26- 
2, a 2 5-3 or ct25-4 subunit polypeptides that are described in the present specification are 
also part of the invention. These include, but are not restricted to, nucleic acids encoding 
from amino-acid 1 to between amino-acids 1027 and 1145, preferably between amino- 

30 acids 1062 and 1145 of SEQ ID N°20, as well as nucleic acids encoding from amino-acid 
1 to between amino-acids 984 andl085, preferably between amino-acids 1019 and 1085, of 
SEQ ID N°22. 

Another preferred embodiment of the recombinant vectors according to the invention 
consist of expression vectors comprising a nucleic acid encoding a 2 5-2, CC25-3 or a 2 6-4 
35 subunit polypeptides of the invention, and more preferably a nucleic acid encoding a 
polypeptide selected from the group consisting of the amino acid sequences of SEQ ID 
N°4, SEQ ID N°5, SEQ ID N°6, SEQ ID N°10, SEQ ID N°ll, SEQ ID N°12, SEQ ID 
N°16, SEQ ID N°17 and SEQ ID n°18. 
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Within certain embodiments, expression vectors can be employed to express the secreted 
soluble CC28-2, 028-3 or 02S-4 subunit polypeptides which can then be purified and for 
example, be used as a immunogen in order to raise specific antibodies directed against said 
secreted soluble 0628-2, OC25-3 or a 2 5-4 subunit polypeptides. 
5 Preferred eukaryotic vectors of the invention are listed hereafter as illustrative but not 
limitative examples: pcDNA3, pFLAG, pCMV-Script, pIND, pMClNEO, pHIL, 
pGAPZA, pMT/V5-His-TOPO, pMT/V5-His, pAc5.1/V5-HisA, pDS47/V5-His, pcDNA4, 
pcDNA6, pEFl, pEF4, pEF6, pUB6, pZeoSV2, pRc/CMv2, pcDM8, pCR3.1, pDisplay, 
pSecTag2, pVP22, pEMZ, pCMV/Zeo, pSinRep5, pCEP, pREP, pHook-1 

10 

Preferred bacteriophage recombinant vectors of the invention are PI bacteriophage vectors 
such as described by Sternberg N.L. (1992;1994). 

A suitable vector for the expression of a soluble secreted 028-2, 0C28-3 or 0,28-4 subunit 
polypeptide is a baculovirus vector that can be propagated in insect cells and in insect cell- 
15 lines. Specific suitable host vectors includes, but are not restricted to :pFastBac-l, pIZ/V5- 
His, pBacMan-1, pBlueBac4.5, pBlueBacHis2, pMelBacA, pVL1392, pVL1393 

The recombinant expression vectors from the invention may also be derived from an 
adenovirus such as those described by Feldman and Steig. (1996) or Ohno et al. (1994). 
20 Another preferred recombinant adenovirus according to this specific embodiment of the 
present invention is the human adenovirus type two or five (Ad 2 or Ad 5) or an 
adenovirus of animal origin (French Patent Application n°FR 93 05 954). 

a) Re2uIatory expression sequences 

25 Expression requires that appropriate signals are provided in the vectors, said signals 
including various regulatory elements such as enhancers/promoters from both viral and 
mammalian sources that drive expression of the genes of interest in host cells. The 
regulatory sequences of the expression vectors of the invention are operably linked to the 
nucleic acid a soluble secreted 0.26-2, a 2 8-3 or a 2 8-4 subunit polypeptide. 

30 As used herein, the term "operably linked" refers to a linkage of polynucleotide elements 
in a functional relationship. For instance, a promoter or an enhancer is operably linked to a 
coding sequence if it affects the transcription of the coding sequence. 
More precisely, two DNA molecules (such as a polynucleotide containing a promoter 
region and a polynucleotide encoding a desired polypeptide or polynucleotide) are said to 

35 be "operably linked" if the nature of the linkage between the two polynucleotides does 
not : (1) result in the introduction of a frame-shift mutation or (2) interfere with the ability 
of the polynucleotide containing the promoter to direct the transcription of the coding 
polynucleotide. 

Generally, recombinant expression vectors include origins of replication, selectable 
40 markers permitting transformation of the host cell, and a promoter derived from a highly 
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expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in an appropriate frame with the translation, 
initiation and termination sequences, and preferably a leader sequence capable of directing 
sequences of the translated protein into the periplasmic space or the extra-cellular medium. 

"5 In a specific embodiment wherein the vector is adapted for transfecting and expressing 
desired sequences in eukaryotic host cells, preferred vectors comprise an origin of 
replication from the desired host, a suitable promoter and an enhancer, and also any 
necessary ribosome binding sites, polyadenylation site, transcriptional termination 
sequences, and optionally 5 '-flanking non-transcribed sequences. 

10 DNA sequences derived from the SV 40 viral genome, for example SV 40 origin early 
promoter, enhancer, and polyadenylation sites may be used to provide the required non- 
transcribed genetic elements. 

b) Promoter sequences 

15 Suitable promoter regions used in the expression vectors according to the invention are 
chosen taking into account the host cell in which the heterologous nucleic acids have to be 
expressed. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression, or alternatively can be endogenous to the native polynucleotide 
20 containing the coding sequence to be expressed. 

Additionally, the promoter is generally heterologous with respect to the recombinant vector 
sequences within which the construct promoter/coding sequence has been inserted. 
Preferred eukaryotic promoters are the CMV, polyhidran or OPIE2. 

25 c) Recombinant host cells 

Host cells that have been transformed or transfected with one of the nucleic acids described 
herein, or with one of the recombinant vector, particularly recombinant expression vector, 
described herein are also part of the present invention. 

Are included host cells that are transformed (prokaryotic cells) or are transfected 
30 (eukaryotic cells) with a recombinant vector such as one of those described above. 
Preferred host cells used as recipients for the expression vectors of the invention are the 
following: 

(a) prokaryotic host cells: Escherichia coli, strains, (i.e. DH10 Bac strain) Bacillus 
subtilis, Salmonella typhimurium and strains from species such as Vseudomonas, 

3 5 Streptomyces and Staphylococcus; 

(b) eukaryotic host cells: HeLa cells (ATCC N°CCL2; N°CCL2.1; N°CCL2.2), Cv 
1 cells (ATCC N°CCL70), COS cells (ATCC N°CRL 1650; N°CRL 1651), Sf-9 cells 
(ATCC N°CRL 171 1), C127 cells (ATCC N°CRL-1804), 3T3 cells (ATCC N°CRL-6361), 
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CHO cells (ATCC N°CCL-61), human kidney 293 cells (ATCC N° 45504; N°CRL-1573), 
BHK (ECACC N°84100 501; N°841 11301), sf 9, sf 21 and hi-5 cells. 

D) Production of recombinant secreted soluble 0:28-2, agS-S or oc?8-4 subunit 
5 polypeptides 

The present invention also concerns a method for producing one of the amino acid 
sequences described herein and especially a polypeptide selected from the group consisting 
of the aminoacid sequences of SEQ ID N°4, SEQ ID N°5, SEQ ID N°6, SEQ ID N°10, 
10 SEQ ID n°ll, SEQ ID n°12, SEQ ID n°16, SEQ ID n°17or SEQ ID n°18 wherein said 
method comprises the steps of: 

(a) inserting the nucleic acid encoding the desired amino acid sequence in an 
appropriate vector; 

(b) culturing, in an appropriate culture medium, a host cell previously transformed 
15 or transfected with the recombinant vector of step (a); 

(c) harvesting the culture medium thus obtained or lyse the host cell, for example 

by sonication or osmotic shock; 

(d) separating or purifying, from said culture medium, or from the pellet of the 

resultant host cell lysate, the thus produced recombinant polypeptide of interest. 

20 

In some instances, it is required to tag the secreted soluble CC2S-2, CC2S-3 or a.28-4 subunit 
polypeptide prior to purification. The tag is then in most instances encoded into the 
nucleotide sequence that is need to express the polypeptide. Examples of such tags include, 
but are not limited to sequences encoding C-myc, FLAG, a sequence of histidine residues, 

25 heamaglutin A, V5, Xpress or GST. Most of these tags can be incorporated directly into 
the sequence, for instance through PCR amplification by incorporating the appropriate 
coding sequence in one of the PCR amplification primers. However, the tag can also be 
introduced by other means such as covalent binding of the appropriate nucleic acid 
sequence encoding the tag moiety with the 3' or 5' end of the nucleic acid sequence 

30 encoding the polypeptide sequence. This is the case for GST. 

Purification of the recombinant secreted soluble 0:26-2, CC28-3, 0:26-4 subunit polypeptides 
according to the present invention is then carried out by passage onto a nickel or copper 
affinity chromatography column, such as a Ni NTA column or a Q-Sepharose column. 
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In another embodiment of the above method, the polypeptide thus produced is further 
characterized, for example by binding onto an immuno-affmity chromatography column on 
which polyclonal or monoclonal antibodies directed to the secreted soluble OC25-2 subunit 
polypeptide of interest have been previously immobilised. 

5 

In another embodiment of the invention, the secreted soluble 0C2S-2, a.25-3, a.25-4 subunit 
polypeptide can be only partially purified. For instance, it can be purified along with other 
contaminating proteins using an appropriate chromatography matrix such as an ion- 
exchange chromatography column. In such instances, it is not required to tag the desired 
1 0 polypeptide of interest. 

The most preferred embodiment contemplated by the inventors concerns the use of a 
purified tagged secreted soluble ct 2 5-2 , a 2 8-3 or a2§-4 subunit polypeptide. A particularly 
preferred tag is a nucleotide sequence encoding from 2 to 10, and preferably 6 histidine 
1 5 residues. 

With regard to the secreted soluble ct25-2 , 0:28-3 or ct25-4 subunit polypeptide used 
subsequently in the screening assay of the invention, several possibilities are also open to 
the skilled person. 

20 

In a first and preferred embodiment, the secreted soluble a.28-2, a.28-3 or ot2S-4 subunit 
polypeptide comprises a tag moiety which can be selected among the tags referred to 
above. Such tagged polypeptides are particularly useful in SPA or flashplate assays. A 
preferred tag is the nucleotide sequence encoding histidine residues referred to above. 

25 

In a second embodiment, the secreted soluble 0:28-2, 0:28-3 or subunit polypeptide 

can be used without a tag. This is the case for instance in SPA or flashplate assays 
comprising beads or plates coated with wheat germ lectin. In such an embodiment, the tag 
is not needed as the carbohydrate moieties of the secreted soluble 0:28-2, a.25-3 or 0.28-4 
30 subunit polypeptide bind directly to the wheat germ lectin-coated beads or plates. 
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1) Labelled compounds which bind the secreted soluble ot2S-2, a 7 8-3 or a?5-4 
subunit polypeptide 

In cases where the ot25-2, a 2 5-3 or a 2 5-4 binding site is the [ 3 H]gabapentin binding site, 
the preferred labelled compound which can be used is of course gabapentin itself. 
5 However, gabapentin is not the only labelled compound which can be used in this context. 
Indeed, it has been previously demonstrated that saturation binding analyses on porcine 
synaptic plasma cerebral cortex membranes performed in the presence of L-leucine 
indicate a competitive interaction of the amino acid with the [ 3 H]gabapentin binding site, 
significantly reducing [ 3 H]gabapentin binding affinity for the site. The inventors believe 
10 that this competitive interaction is true across across all the amino-acids listed in table 1 
below. 



15 Table 1 

Binding affinities of selected amino acids (IC50 <500nM) for the [ 3 H] gabapentin site in 
porcine cortical membranes 





Compound 


ICsn (NM) ARITHMETIC MEAN (N=3) ± S.E.M. 


20 


Gabapentin 


42.1 + 5.5 




L-Norleucine 


23.6 + 6.7 




L- Al lo-Isoleucine 


32.8 + 6.0 




L-Methionine 


49.6 + 10.0 




L-Leucine 


61.3 ±20.9 


25 


L-Isoleucine 


68.8 ± 1.9 




L-Valine 


330 + 18 




L-Phenylalanine 


351 + 89 



It is therefore possible to use commercialy available labelled forms of these high affinity 
30 ligands in replacement of gabapentin. The utility of [ 3 H]L-leucine has been demonstrated 
in a filter binding assay and in a flashplate assay format. The inventors believe that labelled 
amino acids but also other compounds, with affinities preferably below 500 nM in the 
binding assay can be used as replacements of gabapentin. 
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With regard to the label, several embodiments can be used in the context of the invention. 
Preferred labels are of course radioactive labels, a list of which is provided further in this 
specification. 

5 

2) Assay formats and conditions 

Several assay formats can be used to carry out the method of the present invention. 

Preferred assay formats include scintillation assays such as the scintillation proximity 
assay (SPA) or the flashplate assay. Other assay formats well known to those skilled in the 
10 arts such as the filter binding assay and the centrifugation assay are also contemplated in 
the present invention. 

SPA and flashplate assays are preferred assay formats for the present invention. Additional 
details on these assays are provided below. 

15 Scintillation assay format 

Scintillation assays technology either involves the use of scintillant beads (for the SPA 

assay) or plates (for the flashplate assay). SPA beads are usually made from either cerium- 
doped yttrium ion silicate (y2SiOS:Ce) or polyvinyltoluene (PVT) containing an organic 
scintillant such as PPO. Flashplates commonly used are those such as Ni chelate 
20 flashplates although other flashplates can also be used. 

Assays are usually carried out in aqueous buffers using radioisotopes such as 3 H, 125 1, 14 c, 
35 S or 33 P that emit low-energy radiation, the energy of which is easily dissipated in an 
aqueous environment. For example, the electrons emitted b y 3 H have an average energy 

25 of only 6 keV and have a very short path length (-1 ~tm) in water. If a molecule labelled 
with one of these isotopes is bound to the bead or flashplate surface, either directly or via 
interaction with another molecule previously coupled to the bead or flashplate, the emitted 
radiation will activate the scintillant and produce light. The amount of light produced, 
which is proportional to the amount of labelled molecules bound to the beads, can be 

30 measured conveniently with a liquid scintillation (LS) counter. If the labelled molecule is 
not attached to the bead or a flashplate surface, its radiation energy is absorbed by the 
surrounding aqueous solvent before it reaches the bead, and no light is produced. Thus, 
bound ligands give a scintillation signal, but free ligands do not, and the need for atime- 
consurning separation step, characteristic of conventional radioligand binding assays, is 
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eliminated. The manipulations required in the assays are reduced to a few simple pipetting 
steps leading to better precision and reproducibility. 

The conditions under which SPA and flashplate assays are performed in the context of the 
present invention are provided below. 

5 

Scintillation assay conditions 
a) SPA assay 

The SPA assays is first developed to optimize the conditions under which the radioligand 
binds the secreted soluble CC28-2, ot25-3 or ot25-4 subunit polypeptide. The parameters 
10 which can be varied to optimize radioligand binding in a typical SPA assay using 
Amersham beads include assay temperature, 0:28-2, 0:26-3 or ct25-4 subunit polypeptide 
interaction with the radioligand and the SPA beads, radioligand concentration as well as 
pH variations. 

15 The temperature at which the assay can be carried out can vary from 1 to 30°C. Preferred 
temperatures range from 18 to 23 °C, with 21°C being the most preferred temperature. The 
interaction of the 0128 subunit polypeptide with the SPA beads can be optimized by 
adjusting the concentration of the polypeptide and by introducing a reagent which will 
favor this interaction. When 50 mg of Amersham SPA beads are used, the 0:28-1 subunit 

20 polypeptide concentration may vary from 0.1 to 10 pmoles per well, with the optimal 
concentration being generally around 5 to 6 pmoles per well. 

As for the reagent favoring the interaction between the secreted soluble a.28-2, 0:28-3 or 
0:28-4 subunit polypeptide and the radioligand as well as the Amersham SPA beads, the 
25 inventors found that imidazole could be efficiently used for that purpose when the OC2S-2, 
012S-3 or ct2S-4 subunit polypeptide is tagged with an amino acid sequence including 6 
histidine residues. Furthermore, and more importantly, it was found that imidazole also 
enhanced binding of the radioligand to the ct2S-2, a 2 8-3 or oc 2 5-4 polypeptide. 

30 The concentration of the radioligand is evaluated with respect to the concentration of 
secreted soluble ot 2 S-2, a 2 8-3 or a 2 8-4 subunit polypeptide present in the assay medium. 
Generally, the concentration of radioligand varies from 1 nM to 100 nM. A preferred 
[ 3 H]gabapentin concentration is about 5 to 20 nM, with a most preferred concentration 
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being about 10 nM. A preferred [ 3 H]leucine concentration is also about 5 to 20 nM, with a 
most preferred concentration being about 10 nM. It is to be noted that the concentration of 
other radioligands having affinities similar to those of [ 3 H]gabapentin and [ 3 H]leucine 
should also be in the range of about 5 to 20 nM. 

5 

Once the optimal radioligand binding conditions have been determined, a test ligand can be 
introduced in the assay medium to evaluate the level of displacement of the radioligand. 
The concentration of test ligand to be introduced in the assay medium usually varies from 
0.1 nM to about 100 uM. A preferred test ligand concentration of about 10 uM is usually a 
10 starting point in a high throughput screening assay. Then, depending on the number of hits 
obtained, it may be lowered or increased. 

It is to be noted that the parameters set forth above, which have been evaluated for a 
typical SPA assay using Amersham SPA beads can be adjusted by the skilled person, for 
1 5 example if SPA beads of a different type are used. 

b) Flashplate assay 

Similarly to the SPA assays, the flashplate can first be developed in order to optimize the 
conditions under which the radioligand binds the a 2 8 subunit polypeptide. The parameters 
20 which can be varied to optimize radioligand binding in a typical flashplate assay using 
NEN Ni chelate flashplates also include assay temperature, secreted soluble 0,26-2, 0,28-3 
or ot25-4 subunit polypeptide interaction with both the radioligand and the flashplates, 
radioligand concentration as well as pH variations. 

25 The temperature at which the assay can be carried out can vary from 1 to 30°C. Preferred 
temperatures range from 18 to 23°C, with 21°C being the most preferred temperature. 

The interaction of the secreted soluble a 2 8-2, a 2 8-3 or a 2 5-4 subunit polypeptide with the 
flashplates can be optimized by adjusting the concentration of the polypeptide and by 
30 introducing a reagent which will favor this interaction. When a standard NEN Ni chelate 
flashplate is used, the secreted soluble a 2 5-2, a 2 8-3 or a 2 5-4 subunit polypeptide volume 
usually varies between 0.5 and 20 ul for a concentration of secreted soluble a 2 5-2, a 2 5-3 or 
a 2 5-4 subunit polypeptide of 0.6 pmol/ul. As the published maximum binding capacity of 
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NEN p plates is about 6 pmol per well, the inventors consider that an optimal concentration 
of secreted soluble 028-2, 0:28-3 or 028-4 subunit polypeptide is probably around 5 pmol 
per well at 8 ul. 

5 With regard to the reagent favoring the interaction between the secreted soluble 0:28-2, 
a 2 8-3 or oc 2 8-4 subunit polypeptide and the radioligand as well as the flashplates, the 
inventors believe that imidazole could also be efficiently used for that purpose when the 
secreted soluble 028-2, OC28-3 or 0,28-4 subunit polypeptide is tagged with an amino acid 
sequence including 6 histidine residues. The inventors also believe that imidazole 

10 concentrations can substantially enhanced binding of the radioligand to the secreted 
soluble a 2 8-2, a 2 8-3 or 028-4 polypeptide. The optimal concentration of imidazole used to 
enhance radioligand binding varies depending on the concentration of secreted soluble 
028-2, 028-3 or a 2 8-4 subunit polypeptide used in the assay. For instance, when the 
volume of the 028-1 subunit polypeptide is about 10 ul ul (028-2, a 2 8-3 or 028-4 

15 polypeptide concentration of 0.6 pmol/ul), the optimal imidazole concentration can vary 
between 1 and 20 mM, with a concentration of about 10 raM being preferred. As 
mentioned previously, other compounds such as histidine as well as pH variations may be 
used to enhance radioligand binding. 

20 The concentration of the radioligand is evaluated with respect to the concentration of 028- 
2, a 2 8-3 or a 2 S-4 subunit polypeptide present in the assay medium. Generally, the 
concentration of radioligand varies from 1 nM to 100 nM. A preferred [ 3 H]gabapentin 
concentration is about 5 to 20 nM, with a most preferred concentration being about 10 nM. 
A preferred [ 3 H]leucine concentration is also about 5 to 20 nM, with a most preferred 

25 concentration being about 10 nM. It is to be noted that the concentration of other 
radioligands having affinities similar to those of [ 3 H]gabapentin and [ 3 H]leucine should 
also be in the range of about 5 to 20 nM. 

Once the optimal radioligand binding conditions have been determined, a test ligand can be 
30 introduced in the assay medium to evaluate the level of displacement of the radioligand. 
The concentration of test ligand to be introduced in the assay medium usually varies from 
0.1 nM to about 100 uM. A preferred test ligand concentration of about 10 uM is usually a 
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starting point in a high throughput screening assay. Then, depending on the number of hits 
obtained, it may be lowered or increased. 

The inventors have tested the displacement of a particular radioligand, [ 3 H]gabapentin, 
5 with (S+)-3-isobutly gaba. The data provided in the examples which follow clearly shows 
that the assay can be used in high throughput competition studies. 

E) Purified recombinant secreted soluble ai5-2, ard-3 or ag5-4 polypeptides 

10 Another object of the present invention consists of a purified or isolated recombinant 
polypeptide comprising the amino acid sequence of a secreted soluble 028-2, ot28-3 or OC28- 
4 subunit polypeptide. 

Preferred isolated recombinant polypeptides of the invention include those having at least 
15 80%, preferably 90%, more preferably 95, and most preferably 98 or 99%, amino-acid 
identity with polypeptides comprising from amino acid 1 to between amino-acids 1027 and 
1145, preferably between amino-acids 1062 and 1145 of SEQ ID N°20, as well as those 
having at least 80%, preferably 90%, more preferably 95, and most preferably 98 or 99%, 
amino-acid identity with polypeptides comprising from amino acid 1 to between amino- 
20 acids 984 and 1085, preferably between amino-acids 1019 and 1085 of SEQ ID N°22. 

In a further preferred embodiment, the polypeptide comprises an amino acid sequence 
having at least 80%, preferably 90%, more preferably 95%, and most preferably 98% or 
99% amino acid identity with the amino acid sequence of SEQ ID N°4, SEQ ID N°5, SEQ 
25 ID N°6, SEQ ID N°10, SEQ ID N°ll, SEQ ID N°12, SEQ ID N°16, SEQ ID N°17 and 
SEQIDN 0 19. 

F) Modified secreted soluble 0.26-2, ai5-3 or agS-4 subunit polypeptides 

30 The invention also relates to secreted soluble 028-2, 028-3 or 028-4 subunit polypeptide 
comprising amino acid changes ranging from 1, 2, 3, 4, 5, 10, 20, 25, 30, 35, 40 
substitutions, additions or deletions of one amino acid as regards to polypeptides of anyone 
of the amino acid sequences of the present invention. Preferred sequences are those of SEQ 
ID N°4, SEQ ID N°5, SEQ ID N°6, SEQ ID N°10, SEQ ID N°l 1, SEQ ID n°12, SEQ ID 

35 n°16, SEQ ID n°17 and SEQ ID N°18. 

In the case of an amino acid substitution in the amino acid sequence of a polypeptide 
according to the invention, one or several consecutive or non-consecutive amino-acids are 
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replaced by "equivalent" amino-acids. The expression "equivalent" amino acid is used 
herein to designate any amino acid that may be substituted for one of the amino-acids 
belonging to the native protein structure without decreasing the binding properties of the 
corresponding peptides to the antibodies raised against the polypeptides of the invention. 
5 In other words, the "equivalent" amino-acids are those which allow the generation or the 
synthesis of a polypeptide with a modified sequence when compared to the amino acid 
sequence of the secreted soluble ct 2 8-2, a 2 8-3 or a 2 8-4 subunit polypeptides of interest, 
said modified polypeptide being able to bind to the antibodies raised against the secreted 
soluble 0:28-2, 0128-3 or CC2S-4 subunit polypeptide of interest and/or to induce antibodies 

10 recognizing the parent polypeptide. 

Alternatively, amino acid changes encompassed are those which will not abolish the 
biological activity of the resulting modified polypeptide. These equivalent amino-acids 
may be determined either by their structural homology with the initial amino-acids to be 
replaced, by the similarity of their net charge or of their hydrophobicity, and optionally by 

15 the results of the cross-immunogenicity between the parent peptides and their modified 
counterparts. 

The peptides containing one or several "equivalent" amino-acids must retain their 
specificity and affinity properties to the biological targets of the parent protein, as it can be 
assessed by a ligand binding assay or an ELISA assay. 

20 Examples of amino-acids belonging to specific classes include Acidic (Asp, Glu), Basic 
(Lys, Arg, His), Non-polar (Ala, Val, Leu, He, Pro, Met, Phe, Trp) or uncharged Polar 
(Gly, Seu, Thr, lys, Tyr, Asn, Gin) amino-acids. 

Preferably, a substitution of an amino acid in OC2S-2, 0.26-3 or a 2 S-4 subunit polypeptide 
of the invention, or in a peptide fragment thereof, consists in the replacement of an amino 

25 acid of a particular class for another amino acid belonging to the same class. 

By an equivalent amino acid according to the present invention is also contemplated the 
replacement of a residue in the L-form by a residue in the D form or the replacement of a 
Glutamic acid (E) residue by a Pyro-glutamic acid compound. The synthesis of peptides 
containing at least one residue in the D-form is, for example, described by Koch (1977). 

30 A specific embodiment of a modified peptide of interest according to the present invention, 
includes, but is not limited to, a peptide molecule, which is resistant to proteolysis. This is 
a peptide in which the -CONH- peptide bond is modified and replaced by a (CH 2 NH) 
reduced bond, a (NHCO) retro inverso bond, a (CH 2 -0) methylene-oxy bond, a (CH 2 S) 
thiomethylene bond, a (CH 2 CH 2 ) carba bond, a (CO-CH 2 ) cetomethylene bond, a (CHOH- 

35 CH 2 ) hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH=CH-bond. 

The invention also encompasses secreted soluble a 2 8-2, a 2 8-3 or a 2 S-4 subunit 
polypeptide in which at least one peptide bond has been modified as described above. 
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The polypeptides according to the invention may also be prepared by the conventional 
methods of chemical synthesis, either in a homogenous solution or in solid phase. As an 
illustrative embodiment of such chemical polypeptide synthesis techniques, it may be cited 
the homogenous solution technique described by Houbenweyl (1974). 
5 The secreted soluble ot2S-2, OC25-3 or a.28-4 subunit polypeptide of interest, or a fragment 
thereof may thus be prepared by chemical synthesis in liquid or solid phase by successive 
couplings of the different amino acid residues to be incorporated (from the N-terminal end 
to the C-terminal end in liquid phase, or from the C-terminal end to the N-terminal end in 
solid phase) wherein the N-terminal ends and the reactive side chains are previously 
1 0 blocked by conventional groups. 

For solid phase synthesis, the technique described by Merrifield (1965a; 1965b) may be 
used in particular. 

G) Antibody production 

15 

The secreted soluble a.28-2, a.25-3 or OC28-4 subunit polypeptides of the invention and their 
peptide fragments of interest can be used for the preparation of antibodies. 
Polyclonal antibodies may be prepared by immunization of a mammal, especially a mouse 
or a rabbit, with a polypeptide according to the invention that is combined with an adjuvant 
20 of immunity, and then by purifying the specific antibodies contained in the serum of the 
immunized animal on an affinity chromatography column on which has previously been 
immobilized the polypeptide that has been used as the antigen. 

Monoclonal antibodies may be prepared from hybridomas according to the technique 
described by Kohler and Milstein (1975). 
25 The present invention also deals with antibodies produced by the trioma technique and by 
the human B-cell hybridoma technique, such as described by Kozbor et al. (1983). 
Antibodies of the invention also include chimeric single chain Fv antibody fragments (US 
Patent N° 4,946,778; Martineau et al., (1998), antibody fragments obtained through phage 
display libraries Ridder et al. (1995) and humanized antibodies (Leger et al., (1997)). 

30 

ID Screening assays 

The invention concerns a method for the screening of ligands which bind soluble secreted 
ot25-2, 0126-3 or 0,26-4 subunit polypeptide. More particularly, the targeted a.28-2, a-25-3 or 
ot2S-4 subunit binding site is preferably the [ 3 H]gabapentin binding site. The various 
35 parameters of the method of the invention are described in further detail below. 
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EXAMPLES 
5 Example 1 

Construction of a nucleotide sequence encoding a soluble secreted human a.2§-2 
subunit polypeptide deletion mutant of SEP ID N°23 

a) Primer design 

10 PCR primers were designed to generate the secreted soluble human a.28-2 deletion mutant 
of SEQ ID N° 23 as follows: 

5' PCR primer: This was designed to engineer in a KOZAK translation initiation 
consensus sequence prior to the coding sequence (KozakJBC 266 19867-19870) 
3 ' PCR primer: This was designed to engineer in six histidine residues followed by a stop- 
15 codon at the desired location in the coding sequence. In addition to the stop codon the a.28- 

2 primers also included unEco RI restriction site. 

The bold region in each primer sequence denotes the 'tagged' region; addition of 
sequences not present in the template. Primers were custom synthesized by Perkin Elmer 
20 Applied Biosystems UK to the ABI ready pure grade, supplied lyophilized then 

resuspended to 15uMin lOmM TE. JB197 and 198 were provided with 5' phosphate 
groups: 

5' Primer JB197 (5' -tcgccaccatggcggtgccggctc-3' ,seq id n°25) 

25 

3' Primer JB198 (5' -tcggaattcctcagtgatggtgatggtgatgggccccgcggccacagtc-3' , seq id 

N°28) 

b) Protocol for PCR mediated 5' Kozak and 3' 6His tagging of human 0,28-2 

30 

The full length human 0:28-2 gene (Gen Bank Accession Number AF042792) in a pcDNA 

3 vector as described in Brown, J.P. and Gee,N.S., Cloning and deletion mutagenesis of the 
a2S calcium channel subunit from porcine cerebral cortex, The journal of biological 
chemistry, 273(39):25458-25465) was used as the template in the following PCR reaction. 

35 The reagents were added in the following order in triplicate to a 96 well PCR plate: 

/// 

J Ox Pfc Amplification buffer 5 

JOmMdNTPs 1.5 
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50mM MgS0 4 
15uMJB197 



1 



1.5 



15uMJB198 



1.5 



1 OOng/ ul pcDNA3 . 1 -humans-ot25-2 1 



5 lOxPCR Enhancer 



5 



H 2 0 



32.7 



2.5 UNITS/ JUL Pfx polymerase 



0.8 juL 



The plate was the cycled on an MJ Tetrad DNA engine according to the following cycling 
10 conditions: 

94°C / 2mins 
followed by: 

for 30 cycles 94°C/45sec 
15 58°C/45sec 



followed by: 
68°C / lOmins 
followed by: 
20 hold at 4°C 

The 3366bp product was then gel purified from a 1% TAE agarose gel using QIAEX beads 
and eluted in approximately 50j_il TE. 



Example 2 

Cloning of the PCR fragments of Example 1 into the Baculovirus transfer vector 
pFastBacl 

30 The PCR products of Example 1 were cloned into Stu I digested, calf intestinal 

phosphatase dephosphorylated, phenol chloroform extracted and QIAEX gel purified 
pFastBacl (Life Technologies) using the Rapid DNA ligation kit (Roche Diagnostics) 
transforming XL 1 -blue (ct25-lb) E. Coli cells: 



68°C / 4mins 



25 



25 



a) Screening for positive recombinants 

Given that the PCR product was cloned by blunt-end ligation a screen was required to 
select a recombinant with the gene ligated in the positive orientation with respect to the 
5 polyhedrin promoter in pFastBacl. This was achieved by restriction digest of miniprep 
DNA (Qiagen miniprep kit) prepared from colony minicultures and analysis on a 1% TAE 
agarose gel. A positive clone was identified according to the following digest patterns: 

SEQ ID N° 23 in pFastBacl 
10 Eco RI digest performed on miniprep DNA 

Predicted fragments (bp) 
PCR product cloned in a positive orientation 4773 and 3368 
PCR product cloned in a negative orientation 8127 and 14 

15 b) Sequencing analysis of selected clones 

One positive was selected for this clone and used to prepare a plasmid DNA stock of the 
desired construct (QIAGEN maxi kit). Confirmatory sequence reactions were performed 
using the Big Dye terminator sequencing kit and run on an ABI 310 Prism Genetic 
Analyzer. Sequence analysis of both coding strands was performed using a selection of 
20 sequencing oligonucleotide primers. 

Example 3 

Protocol for establishing baculovirus banks for the expression of the ai5-2 deletion 
mutant SEP ID N°23 

25 

Essentially, the protocol used to generate the baculovirus banks is that outlined in the Life 
Technologies Bac-to Bac™ baculovirus expression systems manual. 

a) Transposition of DHlOBacff Coli cells 

30 One ng (5ul) of the recombinant pFastBac-1 construct containing the nucleotide sequence 
encoding the porcine a 2 5-2 deletion mutant of SEQ ID N°23 was added to 100u.l of 
DHlOBac cells thawed on ice. The cells were then mixed gently by tapping the tube then 
incubated on ice for 30minutes before heat shock treatment by incubation in a 42°C water 
bath for 45 seconds. The mixture was then chilled on ice for 2 minutes before the addition 

35 of 900ul of S.O.C. medium. The mixture was then placed in a shaking incubator (200rpm) 
at 37°C for 4hours. The cells were then serially diluted (10 fold dilutions from 10" 1 to 10" 3 ) 
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and lOul of each dilution plated on LB agar plates containing 50ug/ml kanamycin, 7ug/ml 
gentamicin, lOug/ml tetracycline, lOOug/ml Bluo-gal and 40ug/ml IPTG. The plates were 
incubated at 37°C for between 1 and 3 days until discrete colonies of blue and white colour 
were discernible. 

5 

b) Isolation of recombinant DNA 

White colonies (containing the recombinant bacmid) were picked and grown for 24 hours 
(to stationary phase) at 37°C with shaking (200rpm) in 2ml of LB containing 50u,g/ml 
kanamycin, 7u.g/ml gentamicin and lOug/ml tetracycline. 1.5ml of culture was then 

10 transferred to a microfuge tube and centrifuged at 14,000xg for lminute. The supernatant 
was removed and the cells resuspended gently in 0.3ml of 15mM Tris-HCl (pH8.0), lOmM 
EDTA, 100u.g/ml RNase A. 0.3ml of 0.2N NaOH, 1% SDS was then added and the 
mixture mixed gently before incubation at 22°C for 5 minutes. Then 0.3ml of 3M 
Potassium acetate (pH5.5) was added and the sample placed on ice for 10 minutes. After 

1 5 centrifugation at 14,000xg for 10 minutes the supernatant was transferred to a tube 
containing 0.8ml of isopropanol, mixed then placed on ice for 10 minutes before 
centrifugation at 14,000xg for 10 minutes. The supernatant was then discarded and the 
pellet rinsed with 0.5ml of 70% ethanol before centrifugation at 14,000xg for 5 minutes. 
This 70% ethanol rinse was then repeated before removing all of the supernatant and air 

20 drying the pellet for lOminutes at room temperature. The pellet was finally resuspended in 
40ul of TE. 

c) Transfection of sf9 cells with the recombinant bacmid DNA 

A 6-well tissue culture plate was seeded with 0.9xl0 6 sf9 cells (cells at log phase having 
25 grown from a culture passaged at 0.3xl0 6 cells/ml) per 35mm well in 2ml of Sf-900 II 
SFM media containing 50units/ml penicillin and 50ug/ml streptomycin. Cells were left to 
attach at 27°C for 1 hour. Bacmid DNA prepared as described above (5ul) was added to 
200u.l of Sf-900 II SFM media containing 6ul of CELLFECTIN and mixed before 
incubation at room temperature for 45 minutes. The cells were washed once with 2ml of 
30 Sf-900 II SFM media without antibiotics then 0.8ml of Sf-900 II SFM media was added to 
each tube containing the lipid-DNA complex. The wash buffer was removed from the cells 
and the 1ml of diluted lipid-DNA complex overlaid on the cells. The cells were incubated 
for 5hours at 27°C after which time the transfection mixture was removed and 2ml of Sf- 
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900 II SFM media containing 50units/ml penicillin and 50ug/ml streptomycin was added. 
The cells were then incubated for 72 hours. 

After incubation for 72 hours the media was removed from the cells and centrifuged at 
5 500xg for 5 minutes. The supernatant was then transferred to a fresh tube, this was labelled 
as the P0 bank and stored at 4°C in the dark. The PI bank was prepared by passaging sf9 
cells at approx 5xl0 6 cells/ml to 2xl0 6 cells/ml (100ml in a 250ml Erlenmeyer flask) and 
adding 0.5ml of the P0 bank harvested above. The cells were then incubated shaking 
(200rpm) at 27°C for 4 days. Under sterile conditions the culture was centrifuged at 500xg 
10 for 10 minutes and the supernatant 0.2uM filtered (PI bank). The P2 bank was prepared by 
adding 2ml of PI bank per 400ml culture (in 1L Erlenmeyer flasks) passaged as above to 
2xl0 6 cells/ml. The culture was incubated as before for 4 days and the supernatant 
harvested and filtered as described for the PI bank. The supernatant was first pooled then 
aliquoted (10ml) and stored at 4°C. 

15 

Example 4 

Expression of the clt§-2 deletion mutant of SEP ID N°23 

To sf9 cells passaged from ~5xl0 6 cells/ml to 2x106 cells/ml in Sf-900 II SFM media was 
20 added 0.1ml virus per 100ml of cells of the appropriate viral bank (400ml volumes in 1L 
Erlenmeyer flasks). The cells were then cultured for 4-5 days at 27°C with 1 lOrpm 
shaking. Expression of the protein was confirmed by SDS-PAGE and Western blotting 
using an anti penta-His monoclonal antibody (Qiagen) and was detected in the culture 
supernatant and cell lysate. 

25 

Example 5 

Purification of q?5-2deletion mutant of SEP ID N°23 

The-ot 2 5-2 deletion mutant of SEQ ID N°23 was purified from the cell lysate following the 
30 purification strategy outlined below: 

The culture was centrifuged at 6,000xg for 10 minutes and the supernatant removed. The 
weight of the cell pellet was determined before re-suspension in 20mM Tris pH8.0, 
lOOmMKCl, 1% P40-Nonidet (100ml per 20g of wet cells). A protease inhibitor cocktail 
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(Sigma, Cat# P8849), lml/L, was added to the mixture. The solution was then stirred for 
10 minutes before centrifugation for lhour at 30,000xg and 4°C. The supernatant was 
concentrated (30kDa cut off) to approx. ~300ml then centrifuged for lhour at 100,000xg. 

5 Supernatant containing the soluble proteins was diluted 1:3 in lOmM Tris-HCl pH8.0 
(equilibration buffer) and loaded onto a pre-equilibrated Q-Sepharose column (2.5cm i.d. x 
30cm h.) at a flow rate of 900ml/h. After washing with equilibration buffer until a stable 
A280nm baseline had been achieved, protein was eluted with 20mM Tris-HCl pH8.0, 0.5M 
KC1, lOmM Imidazole. 



The eluate was then loaded onto a Ni-NTA (Qiagen) column (2.5cm i.d. x 6cm h.) pre- 
equilibrated in 20mM Tris pH8.0, 0.5M KC1, lOmM Imidazole at a flow rate of 2 ml/min. 
The column was washed successively with buffer A (20mM Tris pH8.0, 0.5M KC1, 20mM 
Imidazole), buffer B (lOOmM Tris-HCl pH8.0, 1M KC1), and buffer A again. Elution was 
15 performed with buffer C (20mM Tris-HCl pH8.0, lOOmM KC1, 0.5M Imidazole). The Ni- 
NTA eluate (~50ml) was concentrated (30kDa cut-off) to ~2ml and applied at lml/min and 
in 0.2ml aliquots, to an FPLC Superdex-200 column equilibrated in lOmMHEPES, pH7.4, 
150mMNaCl. Fractions containing the polypeptide of SEQ ID N°23 were pulled. 



Example 6 

SPA assay of f-Hlgabapentin binding to the secreted soluble humana28-2 subunit of 
SEP ID N°23 

25 The assay is carried out at 21°C. Assay components are added in the following order (all 
reagents are diluted in lOmM HEPES (pH 7.4 at 21°C) to 96-well Optiplates: 

25ul imidazole at various concentrations (diluted from a 1M stock 

pH8.0, see assay details) 
50ul lOmM HEPES pH 7.4 
30 25ul (50mg) SPA beads (Amersham) 



10 



20 



lOOul 



s-a 2 5-2 subunit polypeptide of SEQ ID No 23 (2ul protein 



diluted to lOOul) 



25ul 



radioligand ([ 3 H]gabapentin obtained from example 5 
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Immediately after adding radioligand, the optiplates were loaded in the Packard Top Count 
scintillation counter to follow the binding time course. Imidazole was first used in the 
assay to optimize the specific interaction of the protein's 6His tag with the SPA bead. 
Imidazole itself (up to lOOmM) in the filtration assay has no effect on [ 3 H]gabapentin 
5 binding (n=l). 

Example 7 

Ni Flashplate assay of f - Hlgabapentin binding to secreted soluble human q?5-2 (SEP 
ID N°23> 

10 Assays are carried out at 21°C in a final volume of 250jil in 96-well NEN Ni chelate flash 
plates. Assay components are added in the following order (all reagents were diluted in 
lOmM HEPES (pH 7.4 at 21°C)): 

25ul 1 OmM HEPES pH7.4 

25)0.1 imidazole at various concentrations (diluted from a 1M stock 
15 pH8.0, see assay details) 

75 ul lOmM HEPES pH 7.4 

100|a.l s-a 2 5-2-6His (2ul protein diluted to lOOui) obtained from 
example 5 

25ul radioligand ([ 3 H]gabapentin (65Ci/mmole) 

20 

Immediately after adding the radioligand, flash plates are loaded in the Packard Top Count 
scintillation counter to follow the binding time course. The ' [ 3 H] flash plate' programme 
(cpm) is used to monitor activity. Imidazole is first used in the assay to optimize the 
specific interaction of the protein's 6His tag with the Ni flashplate. 

25 

Example 8 

Ni Flashplate assay of f - HlLeucine binding to secreted soluble human ag5-2-6His 

30 The procedure described in example 7 is repeated, except that [ 3 H]gabapentin is replaced 
by 25 ul (10.1 nM) of [ 3 H]Leucine (141 Ci/mmole). 
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Example 9 

Ni Flashplate assay studying competitive binding of r-Hlgabapentin and (S+V3- 
isobutvl GABA to human q?5-2-6His (SEP ID N°23V 

5 Assays are carried out at 21 °C in a final volume of 250ul in 96-well NEN Ni chelate flash 
plates. Wells are set up for both 'total' and 'non-specific' binding. Specific binding is 
defined as that remaining after subtraction of the average of the 'non-specific binding' 
values from the average of the 'total' binding values. Assay components are added in the 
following order (all reagents were diluted in lOmM HEPES (pH 7.4 at 21°C)): 
10 25jj,1 1 OmM HEPES pH7.4 or 25 ul othe test compound at the 

appropriate concentration in HEPES 
25jal 200 raM imidazole (diluted from a 1M stock pH8.0, see 
assay details) 
Total binding 75 ul 1 OmM HEPES pH 7.4 

15 Non-specific binding 50ul lOmM HEPES pH 7.4 and 25 ul lOOuM (S+)-3-isobutyl 

GABA 

lOOul a 2 5-2-6His (2ul protein* diluted to lOOul) 
25ul radioligand ([ 3 H]gabapentin or [ 3 H]Leucine) 

20 * The source of a 2 5-2-6His is that purified by fplc Superdex-200 gel filtration (see 
example 5) 

Immediately after adding radioligand, flash plates are loaded in the Packard Top Count 
scintillation counter to follow the binding time course. Incubation time before the assay is 
3 hours. The '[ 3 H] flash plate' programme (cpm) is used to monitor activity 
25 Competition studies are compared across the flash-plate and filter binding methodologies 
in order to validate the new assay technology with the established filter binding 
methodology. 
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GraphPad Prism software is used to process competition curve data and determine IC50 and 
hill slope values. Twelve point competition curves with half log dilution steps of test 
compounds are used in the experiments. 



1 



31 

Example 10 

Filter binding assay of r ~ H1gabapentin binding to the recombinant polypeptide of 
SEP ID N°23 

5 Assays were carried out at 21°C in a final volume of 250ul in 96-deep well plates. Assay 
components were (all reagents were diluted in lOmM HEPES (pH 7.4 at 21°C)): 
25\x\ compound to test 

200ul Polypeptide of SEQ ID N°23 (3ul protein diluted to 200ul) 
25ul radioligand ([ 3 H]gabapentin (65Ci/mmole) 

10 

Plates were incubated at room temperature for lh prior to filtering on to 96-well GF/B 
Unifilter plates pre-soaked in 0.3% polyethylenimine. Filters were washed with 3xlml 
50mM Tris-HCl (pH 7.4 at 4°C), and dried over-night. Scintillant (Microscint O, 50ul) was 
added and the plates counted using a Packard Top Count scintillation counter. Specific 
15 binding was -98% of the 'total' value. In [ 3 H]gabapentin saturation studies, the K D (nM) 
obtained was about 10.62. Table 2 below provides the results of the competition studies. 

Example 11 

20 Construction of a nucleotide sequence encoding a soluble secreted mouse q.25-3 
deletion mutant of SEP ID N°25 as follows. 

a) Primer design 

PCR primers were designed to generate the secreted soluble mouse a.25-3 deletion mutant 

25 of SEQ ID N° 25 as follows: 

5' PCR primer: This was designed to engineer in a KOZAK translation initiation 
consensus sequence prior to the coding sequence (Kozak JBC 266 19867-19870) 
3' PCR primer: This was designed to engineer in six histidine residues followed by a stop- 
codon at the desired location in the coding sequence. In addition to the stop codon the 0,28- 

30 3 primers also included an Eco RI restriction site. 

The bold region in each primer sequence denotes the 'tagged' region; addition of 
sequences not present in the template. Primers were custom synthesized by Perkin Elmer 
Applied Biosystems UK to the ABI ready pure grade, supplied lyophilized then 
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resuspended to 15uM in lOmM TE. JB201 and 202 were provided with 5' phosphate 
groups: 

5' Primer JB201 (5'-tcgccaccatggccgggccgggc-3', seq id n°27) 

5 

3' Primer JB202 (5'-tctcagtgatggtgatggtgatgcgatgcacccccacactctc-3', seqidn o 28) 
b) Protocol for PCR mediated 5' Kozak and 3' 6His tagging of mouse a?5-3 

10 The full length mouse ci25-3 gene (Gen Bank Accession number AJO 10949) in the 
pcDNA3 vector as described in Brown, J.P. and Gee,N.S., Cloning and deletion 
mutagenesis of the a28 calcium channel subunit from porcine cerebral cortex, The journal 
of biological chemistry, 273(3 9):25458-25465) was used as the template in the following 
PCR reaction. 



15 
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The reagents were added in the following order in triplicate to a 96 well PCR plate: 





Jul 


10% P 'fx Amplification buffer 


5 


WmMdNTPs 


1.5 


50mM MgS0 4 


1 


15uMJB201 


1.5 


15uM JB202 


1.5 


100ng/jil pcDNA3-mouse-ot25-3 


1 


lOx PCR Enhancer 


5 


H 2 0 


32.7 


2.5 UNITS/ JUL Pfx polymerase 





0.8 JUL 



The plate was the cycled on an MJ Tetrad DNA engine according to the following cycling 
conditions: 

94°C / 2mins 
followed by: 

for 30 cycles 94°C/45sec 
60°C / 45sec 
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68°C / 4mins 

followed by: 
68°C / lOmins 
followed by: 
5 hold at 4°C 

The 3244bp product was then gel purified from a 1% TAE agarose gel using QIAEX beads 
and eluted in approximately 50ul. 

The truncated protein of SEQ ID N°24 was expressed such the procedure of example 2,3 
10 and 4. 
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SEQUENCE LISTING 

1 - human nucleotide sequence 

ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
5 CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGC&TGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGC 
CTACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTC 
GACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACC 
GGAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGA 

1 0 CATTGAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAG 
AACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATG 
ACGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAA 
GGCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTAT 
TCATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATG 

1 5 AGCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCAC 
ACTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCG 
TGGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGG 
GGGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCT 
GACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGAC 

20 TATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGG 
TGCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAA 
GGGCACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAAC 
ATCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCG 
TGCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTC 

25 CGTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGC 
TACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATG 
TGTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGT 
GTATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTG 
ACACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACG 

3 0 TGGCTCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGT 
GTTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACC 
AACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGG 
AAGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGT 
CAAGTCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATA 

3 5 AGGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAG 
CCAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTT 
TGAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCA 
GACAACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACT 
CCAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCA 
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GCTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTC 
GCTGCCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAG 
AGAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGT 
CTTCAAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTG 
5 GGCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAG 
TGGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAA 
CCGTACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGC 
GAGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGT 
CAAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCT 

1 0 GATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCA 
GCCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCA 
CCGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCA 
GCAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAG 
GGGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCT 

1 5 CGGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGC 
GCAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAG 
TGCGAGGCTGGCCGG 

2 - human nucleotide sequence 

20 ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCC 
TACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCG 
ACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCG 

25 GAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGAC 
ATTGAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAGA 
ACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGA 
CGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAG 
GCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATT 

3 0 CATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGA 
GCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACA 
CTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGT 
GGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGG 
GGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTG 

3 5 ACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACT 
ATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGT 
GCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAG 
GGCACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAACA 
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TCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGT 
GCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCC 
GTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCT 
ACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGT 
5 GTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTG 
TATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGA 
CACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGT 
GGCTCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGTG 
TTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCA 

1 0 ACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGA 
AGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTC 
AAGTCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAA 
GGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGC 
CAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTT 

1 5 GAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCAG 
ACAACAACAC CGAGTTC CTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTC 
CAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAG 
CTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCG 
CTGC C AC AGACGGTGGC ATC AC C CGAGTCTTC C C C AAC AAGGC AGCTGAGGACTGGACAGA 

20 GAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTC 
TTCAAGCCCCCAC AC C AGGATGC C CTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGG 
GCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGT 
GGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAAC 
CGTAC C C AC CAAGAC C AGC CTC AGAAGTGCGGC C C C AACAGC CACTGTGAGATGGACTGCG 

25 AGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTC 
AAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTG 
ATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAG 
CCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCAC 
CGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAG 

30 CAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGG 
GGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTC 
GGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCG 
CAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGT 
GCGAGGCTGGCCGGCTGCTGCAGAAGGAGACGCACTGCCCAGCGGACGGCCCGGAGCAGTG 

35 TGAGCTAGTGCAGAG 
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3 - human nucleotide sequence 

ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCC 
5 TACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCG 
ACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCG 
GAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGAC 
ATTGAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAGA 
ACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGA 

1 0 CGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAG 
GCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATT 
CATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGA 
GCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACA 
CTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGT 

1 5 GGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGG 
GGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTG 
ACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACT 
ATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGT 
GCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAG 

20 GGC AC C AC AGGCTACAAGGC CGGCTTTGAGTATGC CTTTGAC C AGCTGCAGAACTC CAAC A 
TCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGT 
GCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCC 
GTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCT 
ACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGT 

25 GTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTG 
TATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGA 
CACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGT 
GGCTCTGAATGAC ATCAAGAGGCTGAC C C C CAACTACACGCTTGGAGC C AACGG CTATGTG 
TTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCA 

30 ACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGA 
AGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTC 
AAGTCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAA 
GGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGC 
CAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTT 

3 5 GAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCAG 
ACAACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTC 
CAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAG 
CTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCG 
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CTGCCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAGA 
GAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTC 
TTCAAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGG 
GCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGT 
5 GGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAAC 
CGTACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGCG 
AGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTC 
AAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTG 
ATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAG 

10 CCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCAC 
CGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAG 
CAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGG 
GGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTC 
GGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCG 

1 5 CAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGT 
GCGAGGCTGGCCGGCTGCTGCAGAAGGAGACGCACTGCCCAGCGGACGGCCCGGAGCAGTG 
TGAGCTAGTGCAGAGACCGCGATACCGGAGAGGCCCGCACATCTGCTTCGACTACAACGCG 
ACAGAAGATACCTCAGACTGTGGCCGCGGGGCC 

20 4 - human amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YS FPQQHTMQHWARRLEQEVDGVMRI FGGVQQLRE I YKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
ASTLRLDFIEDPNFKNKWYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 

25 LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVI IVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYWVASFNEKA.QPVSCFTHLVQA1WRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYY FE I PS IGAI R INTQE YLDVLGRPMVLAGKEAKQVQWTNV 
YEDALGLGLWTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYV 

30 FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELiyiEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 

35 RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYISnsrSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 
QRLTNTNLL F WAE K P L C S Q C E AGR 
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5 - human amino acid sequence 

mVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YS FPQQHTMQHWARRLEQEVDGVMRI FGGVQQLRE I YKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
5 ASTLRLDFIEDPNFKJSFKWYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYV3WASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFE I PS IGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNV 

10 YEDALGLGLWTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYV 
FAIDLNGWLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDKNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 

1 5 FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 
QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQ 

20 

6 - human amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREI YKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 

25 ASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFE I PS IGAIRINTQE YLDVLGRPMVLAGKEAKQVQWTNV 

3 0 YEDALGLGL WTGTL PVFNLTQDGPGEKKNQL I LGVMG IDVALND IKRLTPNYTLGANG YV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 

35 FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGL I YHSWFQAD PAEAEGS PETRESS CVMKQTQYYFGSVNAS YNAI IDCGNCS RL FHA 
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QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGA 

7 - human nucleotide sequence 

5 ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
GGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGCCATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 

10 GTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 
GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGACAAAGACG 
GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTACAACAAAGACCCT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 

1 5 ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 
AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 

20 CCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAAACAAAGAGCACTTCAGGGAGC 
ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 
CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGG C CAG 
ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 

25 AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 
ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
CCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 

30 AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 
AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 
TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 

3 5 GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 
CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 
AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
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CGGTGGTGAGTGC C C C CATTGAAGCGTATTGGAC CAGC CTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 

5 GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
CCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACTGTGAATTGTTACCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 

1 0 TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
GATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 

1 5 AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCAAGTCCTTTGTCATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
CCCCATC 

8 - human nucleotide sequence 

20 ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
GGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGCCATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 

2 5 GTCTGAGGC CGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 

GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGACAAAGACG 
GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTACAACAAAGACCCT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 
30 ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 
AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 

3 5 CCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAAACAAAGAGCACTTCAGGGAGC 

ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 
CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGGCCAG 
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ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 
AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 
ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
5 CCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 
AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 
AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 

1 0 TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 
GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 
CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 

1 5 AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
CGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 
ACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAGAGCCGCTGAGCAGATTCCAGG 

20 GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
CCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACTGTGAATTGTTACCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 

25 TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
GATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 

3 0 AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCAAGTCCTTTGTCATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
C C C CATC ACC ATGGC AC C C ATTGAAAT CAGGTATAATGAATC C CTTAAGTGTGAACGTCTA 
AAG 

35 9 - human nucleotide sequence 

ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
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GGTTC CCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGC CATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 
GTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 
GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGACAAAGACG 
5 GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GC CTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTAC AACAAAGAC C CT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 
ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 

1 0 AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 
CCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAAACAAAGAGCACTTCAGGGAGC 
ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 

1 5 CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGGCCAG 
ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 
AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 

20 ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
CCTGATGACCACTGTAGC C ATGC CTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 
AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 

25 AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 
TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 
GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 

3 0 CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 
AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
CGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 

3 5 ACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAGAGCCGCTGAGCAGATTCCAGG 
GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
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CCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACTGTGAATTGTTACCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 
TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
5 GATC CTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 
AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCAAGTCCTTTGTCATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
1 0 CCCCATCACCATGGCACCCATTGAAATCAGGTATAATGAATCCCTTAAGTGTGAACGTCTA 
AAGGCCCAGAAGATCAGAAGGGCCCAGAAGATCAGAAGGCGCCCAGAATCTTGTCATGGCT 
TCCATCCTGAGGAGAATGCAAGGGAGTGTGGGGGTGCGCCG 

10 - human amino acid sequence 

15 MAGPGSPRRASRGASALLAAALLYAALGDWRSEQQIPLSWKLWASAFGGEIKSIAAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKIDGNFLELGKEFILAPISroH^ 

AI VNGVYWS E S LNKVFVDNFDRDPS L IWQ YFGS AKGFFRQ YPGI KWE PDENGVI AFDCRNR 
KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 

20 PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGS ICSQAIM 
L I TDGAVDT YDT I F AK YNWPDRKVR I FT YL IGRE AAFADNLKWMAC ANKGF FTQ I S TLADV 
QE1WMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
SVDLSEVEWEDRDDVLRNA1WNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIKGTPFSL 

25 GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELiIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLiKAGDKENIFNADHFPLWYRRAAEQIPGSFVYSIPFSTGPVNKSNWT 
ASTS IQLLDERKS PWAAVGIQMKLEFFQRKFWTASRQCASLDGKCS ISCDDETVNCYLID 
NNGF I LVS ED YTQTGD F FGE I EGAVMNKLLTMGS FKR I TL YD YQ AMCRANKE S SDGAHGLL 

30 DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDiyiTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMWVDSSCLCESVAPI 

11 - human amino acid sequence 

MAGPGSPRRASRGASALLAAALLYAALGDWRS EQQ I PLS WKLWASAFGGE IKS IAAKYS 
3 5 GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 
KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 
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PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTI FAKYNWPDRKVRI FTYLIGREAA.FADNLKWMACANKGFFTQ I STLADV 
QEISTVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
5 S VDLS EVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDI KGTPFS L 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENIFNADHFPLWYRRAAEQIPGSFVYSIPFSTGPVNKSNWT 
ASTS IQLLDERKS PWAAVGIQMKLEFFQRKFWTASRQCAS LDGKCS I S CDDETVNC YL ID 
10 NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 
DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMVVVDSSCLCESVAPITMAPIEIRYNESLKCERL 
K 

15 12 - human amino acid sequence 

MAGPGS PRRAS RGAS ALLAAALL YAALGD WRS EQQ I PLS WKLWAS AFGGE IKS IAAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 

20 KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTIFAKY3SWPDRKVRIFTYLIGREAAFADNLKWMACANKGFFTQISTLADV 
QENVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 

25 S VDLS EVEWEDRDDVLRNAMVNRKTGK F SMEVKKTVDKGKRVLVMTND Y Y YTD I KGT P FS L 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELIQEVLFDAWS AP I EAYWTS LALNKS ENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENI FNADHF PLWYRRAAEQI PGS FVYS I PFSTGPVNKSNWT 
ASTS IQLLDERKS PVVAAVGIQMKLEFFQRKFWTASRQCAS LDGKCS ISCDDETVNCYLID 

30 MSJGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 
DPYNAFLSAVKWINTTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMVWDSSCLCESVAPITMAPIEIRYNESLKCERL 
KAQKIRRRPESCHGFHPEENARECGGAP 

35 13 - human nucleotide sequence 

AGTGGCCTCCTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
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AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 
CTGGGCGGCAACGCGGCAGTGCAGCACTGTGGATGGGCCGTGCACACAGAGCTGCGAGGAC 
AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
5 GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTC CTGAC C CAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGC C ATGTGCAAAC C CTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 
GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTACGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
10 CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTACCAGCCGGCCATCCGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTG 

14 - human nucleotide sequence 

1 5 AGTGGCCTCCTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 

20 CTGGGCGGCAACGCGGCAGTGCAGCACTGTGGATGGGCCGTGCACACAGAGCTGCGAGGAC 
AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTCCTGACCCAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGCCATGTGCAAACCCTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 

25 GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTACGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTACCAGCCGGCCATCCGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTGCTG 

3 0 CAGGAGGCGACAGAAGTCAAATATAATGCCTCTGTCAAATGTGACCGGATGCGC 

15 - human nucleotide sequence 

AGTGGCCTCCTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
3 5 GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 
CTGGGCGGCAACGCGG C AGTG C AG CACTGTGGATGGGCCGTGCACAC AGAGCTGCGAGGAC 
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AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTCCTGACCCAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGCCATGTGCAAACCCTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 
5 GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTACGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTAC C AGCCGGC CATC CGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTGCTG 
10 CAGGAGGCGACAGAAGTCAAATATAATGCCTCTGTCAAATGTGACCGGATGCGCTCCCAGA 
AGCTCCGCCGGCGACCAGACTCCTGCCACGCCTTCCATCCAGAGGAGAATGCCCAGGACTG 
CGGCGGCGCCTCG 

16 - human amino acid sequence 

15 SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
SAGEPMWTASTAVAVTVDKRTAIAAAAGVQMKLEFLQRKFWAATRQCSTVDGPCTQSCED 
SDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
HHSAAQPLVSPISAFLTATRWLLQELVLFLLEWSVWGSWYDRGAEAKSVFHHSHKHKKQDP 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPV 

20 

17 - human amino acid sequence 

SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
SAGEPMWTASTAVAVTVDKRTAIAAAAGVQMKLEFLQRKFWAATRQCSTVDGPCTQSCED 
SDLDCFVIDJSnSTGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
25 HHSAAQPLVSPISAFLTATRWLLQELVLFLLEWSVWGSWYDRGAEAKSVFHHSHKHKKQDP 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPVL 
Q EATE VK YNAS VKCDRMR 

18 - human amino acid sequence 

30 SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
SAGEPMWTASTAVAVTVDKRTAIAAAAGVQMKLEFLQRKFWAATRQCSTVDGPCTQSCED 
SDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
HHSAAQPLVSPISAFLTATRWLLQELVLFLLEWSVWGSWYDRGAEAKSVFHHSHKHKKQDP 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPVL 

35 QEATEVKYNASVKCDRMRSQKLRRRPDSCHAFHPEENAQDCGGAS 



51 



19 - human alpha2 delta-2 nucleotide sequence 

CGGGCAGCGCAGCCCGCAGAGGCGCTGCGGCCCGTGCAGCCCCGGAGGCCCCTCGCGGAGA 
AGGCGGCGGCGGAGGAGAGGCCGAGTTACCGCCCGCCGCCCGCGCCCCCCCAACCCCGCCG 
CCGCCGCCGCCGCCGCCACTGCCCCCCCTCCCCGCGGCGCCGCATCTTGAATGGAAACATG 
5 GCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCCCCT 
GGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCCGCG 
CCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCCTAC 
AGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCGACG 
GCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCGGAA 

10 CCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGACATT 
GAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAGAACT 
TCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGACGC 
CAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAGGCC 
AGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATTCAT 

1 5 ACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGAGCT 
CAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACACTG 
CTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGTGGC 
GAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGGGGC 
CTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTGACC 

20 CTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACTATG 
TGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGTGCA 
GGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAGGGC 
ACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAACATCA 
CTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGTGCA 

25 GGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCCGTG 
GGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCTACT 
ATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGTGTT 
GGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTGTAT 
GAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGACAC 

3 0 AGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGTGGC 
TCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGTGTTT 
GCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCAACT 
TCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGAAGA 
GATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTCAAG 

3 5 TCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAAGGA 
GCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGCCAA 
TCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTTGAG 
TCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCAGACA 
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ACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTCCAA 
GCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAGCTG 
GTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCGCTG 
CCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAGAGAA 
5 CCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTCTTC 
AAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGGGCA 
TCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGTGGT 
GGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAACCGT 
ACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGCGAGG 

1 0 TTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTCAAA 
CCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTGATG 
CTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAGCCT 
GTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCACCGT 
TGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAGCAG 

15 CTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGGGGA 
GCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTCGGT 
AAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCGCAG 
AGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGTGCG 
AGGCTGGC CGGCTGCTGC AGAAGGAGACG CACTGCCCAGCGGACGGC C CGGAGC AGTGTGA 

20 GCTAGTGCAGAGACCGCGATACCGGAGAGGCCCGCACATCTGCTTCGACTACAACGCGACA 
GAAGATACCTCAGACTGTGGCCGCGGGGCCTCCTTCCCGCCGTCGCTGGGCGTCCTGGTCT 
CCCTGCAACTGCTGCTCCTCCTGGGCCTGCCGCCCCGGCCGCAGCCTCAAGTCCTCGTCCA 
CGCCTCTCGCCGCCTCTGAGCACCCTGCCCCACCCCACCTCCACTCCCACCTCACCCGGCC 
TCTTCGCCTTTCCCACCCTCCTGCCCCACACTCCCCGCCTTAGAGCCTCGTCCCTCCCTCA 

25 CTGAAGGACCTGAGCTGGCCAGGCCCTGAGAGTCTGGTCTGCGCCTTGGGATGGGGAGTCC 
CAAAGCGGGACGCCGCAGGTGTTTGGCACCCAAATCACATCTCACCTCCGAACTGTTCAAG 
TGTCCCCAGACCCTTCTTGCCTGCTGGGCTC C C C C C AGTGGGATGGGACAGGGAGGCCACA 
CGCACTGGTGCCAAAACCAGGCCTCTGCTGCCGCCCTTCCTGGAGGCTGCCTATGTTGGGG 
GGGACCCTGCCTCAGCTGACCCGGCCTCTCTGCCCCACCCAAGCCCAAACTTGGTTTCTGT 

3 0 GAGAATAGTGGAGGAAGGTGAGATGGCCAGTTTGAAGCCTGTGCCTCCCAGCTTAAATCCT 
AGCAGGAGAGAGGCTCTGGGGCAGCCCCCATGGGCTCCTGCCCCTTTCAGGCCTACAGCCA 
CATCCCCAAGCCCACCAGGTGTCAGGATAGTCACAGTGATACCAGTTCAGACACTACCCCA 
TATACACCTGGAACATTGAGGATGGAAACTGGACTCACATTCGACATACCCCACTGGGCAC 
ACGCACAAACACACACACTATGGGGTGGGGTGGGTGTAGGGGCTTACAAAGCCTTACACAG 

35 GGCGAGGGGTTGGTGGGAGGGTTGGCACCTGCACACTCCATCTCCTGCTCACCACCTGCCT 
CTAATCTGAGCTGCAGCCTGGCTGGTCCTCCCATTTCTAAAGCTGAATGTCAAACAGTGCC 
AAATGCTGGGGCAGGGGGTGAAGAACCCTCTGTCCCACCCCTAGCCACCAGTGTCCTCCAA 
GTGCCCCCTCACCTCTCCAGGTGCTCATTGTAACCATTTCTCACTAGTGTCAGGCCCCCAG 
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TGGGACCACATGCCACTGCCTGCACCTTTCGGCAGAGGAACCCCCACCAGACATCACCCTT 
TGCCTTAGCAGGGGTGACTTTGTCTCTCCTGGCTGGGCCATCCTTCCGCCAATCTGGCCCT 
TACACACTCAGGCCTGTGCCCACTCCCTATCTCCTTCCCACCCCTACACACACACTCCCTG 
CTTGCAGGAGGCCAAACTGTCCCTCCCTTGCTGAACACACACACACACACACACACACAGG 
5 TGGGGACTGGGCACAGCTCTTCACACCATTCATTCTGGTCATTTC C C C CAAAGGCAT C C CA 
GCCTGGGGGCCAGTGGGGAACTGAGGGCAAGGGGATATAGTGATGGGGCTCAGATGGACTG 
GGAGGAGGGGGAGGGTGATGCATTAATTAATGGCTTCGTTAATTAATGTCATGTTGCTTGT 
CGCTTTCTCAGTGTGTGTGTGTGGTCCATGCCCACTGCTGGTGCCAGGGTGGGTGTCCATG 
TGCACCCGGCCTGGATGCCAGCTGTGTCCTTCGGGGGCGTGCGTGTAACTGTAGTGTAGTC 

1 0 AGGTGCTCAATGGAGAATATAAACATATACAGAAAAATATATATTTTAAGTTTAAAAAACA 
GAAAAACAGACAAAACAATCCCCATCAGGTAGCTGTCTAACCCCCAGCTGGGTCTAATCCT 
TCTCATTACCCACCCGACCTGGCTGCCCCTCACCTTGGGCTGGGGGACTGGGGGGCCATTT 
C CTTTTCTCTGC C CTTTTTTTGTTGTTCTATTTTGTACAGACAAGTTGGAAAAACAACAGC 
GACAAAAAAGTCAAGAAACTTTGTAAAATATCGTGTGTGTGATTCCTTGTAAAATATTTTC 

1 5 AAATGGTTTATTACAGAAGATCAGTTATTAAATAATGTTCATATTTTCACTTC 

20- human alpha2 delta -2 amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 

20 IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
ASTLRLDFIEDPNFKNKWYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 

25 VGQHNYDVTPLQWMACANKGYYFEIPS IGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNV 
YEDALGLGL WTGTLPVFNLTQDGPGE KKNQ L ILGVMGIDVALND I KRLTPNYTLGANGYV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
E S EGHVF I APRE YCKDLNAS DNNTE FL KNF I E LME KVT PDS KQCNNFLLHNL I LDTG I TQQ 

30 LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAI IDCGNCSRLFHA 

35 QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGASFPPSLGVLVSLQLLLLLGLPPRPQPQVLVHASRRL 
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21 - human alpha2 delta- 3 nucleotide sequence 

TACTATAGGGCGGCCGCGAATTCGGCACGAGGCGGCGCGGAGCGGAGCAGGCAGCCCCGCG 
CGCTCGCCCACCGCCCGCTCCGCGCAGCTCCCCGCGGCCGCTCTCGTCGCCGCCGCAGCGG 
GCGCGTCGGAGGGAGCCCAGCATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGG 
5 CCTCGGCGCTTCTCGCTGCCGCGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGA 
GCAGCAGATACCGCTCTCCGTGGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAA 
TCCATTGCTGCTAAGTACTCCGGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGA 
AAGACGTTGCCATAGAAGAAATTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACAT 
GGAAGAGATGTTTCACAAGAAGTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAA 

1 0 GCACACCTGAAACATGAATTTGATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGA 
TAAATGAAAGGGACAAAGACGGGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCC 
AAATGACCATTTTAATAATTTGCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACG 
AACATGTACAACAAAGACCCTGCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACA 
AAGTTTTTGTAGATAACTTTGACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGC 

1 5 AAAGGGCTTTTTTAGGCAGTATCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATT 
GCCTTCGACTGCAGGAACCGAAAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGG 
TCATTTTAGTTGACGTCAGTGGCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAAC 
AGTCTCATCCATTTTGGATACACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAAT 
GAGGAGCTTCACTATGTGGAACCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAA 

20 ACAAAGAGCACTTCAGGGAGCATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGA 
TATAGCTCTGAATGAGGCCTTCAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGT 
ATCTGCAGTCAGGCCATCATGCTCATAACTGATGGGGCGGTGGACACCTATGATACAATCT 
TTGCAAAATACAATTGGCCAGATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGA 
GGCTGCGTTTGCAGACAATCTAAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAG 

25 ATCTCCACCTTGGCTGATGTGCAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGC 
C CAAAGTCATCGAC CAGGAGCATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCT 
GACTGATGATCAGGGCCCCGTCCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAG 
AACGAAACCAGATCGAAGGGCATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAG 
AACTTCTGAAGACCATCCCCAAATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCAC 

3 0 AAATAATGGRTATATCCTGACGCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAG 
CGAAGGAAACCTAACTATAGTAGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATG 
ACGTGTTGAGAAATGCTATGGTGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAA 
GACAGTGGACAAAGGGAAACGGGTTTTGGTGATGACAAATGACTACTATTATACAGACATC 
AAGGGTACTCCTTTCAGTTTAGGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCC 

3 5 GAGGGAATGTAACCATCGAAGAAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGC 
AGATGAATGGTCCTACTGCAACACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTA 
GAAGCGATTAAGCTCTACCTAAAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGA 
TCCAAGAAGTCCTTTTTGACGCGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCT 
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GGCCCTCAACAAATCTGAAAATTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGC 
ACGGGCCTCTCCAGAATCAACCTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCC 
TGAAAGCTGGCGACAAGGAGAACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAG 
AGCCGCTGAGCAGATTCCAGGGAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTC 
5 AATAAAAGCAATGTGGTGACAGCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTC 
CTGTGGTGGCAGCTGTAGGCATTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGAC 
TGCCAGCAGACAGTGTGCTTCCCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACT 
GTGAATTGTTACCTCATAGACAATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGA 
CTGGAGACTTTTTTGGTGAGATCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTC 

1 0 CTTTAAAAGAATTACCCTTTATGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGC 
GATGGCGCCCATGGCCTCCTGGATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCA 
TGACAGAACTTGTCTTGTTCCTGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATAT 
GACAGCTAAAGCCCAGAAATTGAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCA 
TTCGTCTCTGAGCGCACCATCAAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCA 

1 5 AGTCCTTTGTCATCCAGCAAATCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAG 
CTGCCTCTGTGAATCTGTGGCCCCCATCACCATGGCACCCATTGAAATCAGGTATAATGAA 
TCCCTTAAGTGTGAACGTCTAAAGGCCCAGAAGATCAGAAGGCGCCCAGAATCTTGTCATG 
GCTTCCATCCTGAGGAGAATGCAAGGGAGTGTGGGGGTGCGCCGAGTCTCCAAGCCCAGAC 
AGTCCTCCTTCTGCTCCCTCTGCTTTTGATGCTCTTCTCAAGGTGACACTGACTGAGATGT 

20 TCTCTTACTGACTGAGATGTTCTCTTGGCATGCTAAATCATGGATAAACTGTGAACCAAAA 
TATGGTGCAACATACGAGACATGAATATAGTCCAACCATCAGCATCTCATCATGATTTTAA 
ACTGTGCGTGATATAAACTCTTAAAGATATGTTGACAAAAAGTTATCTATCATCTTTTTAC 
TTTGCCAGTCATGCAAATGTGAGTTTGCCACATGATAATCACCCTTCATCAGAAATGGGAC 
CGCAAGTGGTAGGCAGTGTCCCTTCTGCTTGAAACCTATTGAAACCAATTTAAAACTGTGT 

25 ACTTTTTAAATAAAGTATATTAAAATCATAAAAAAAAAAAAAAAAAAAA 

22 - human alpha2 delta -3 amino acid sequence 

MAGPGS PRRAS RGAS ALLAAALL YAALGDWRS EQQ I PLS WKLWAS AFGGE I KS I AAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 

30 DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFMNLPWISLSDVQVPTNMYNKDP 
AIVNGVYWS ES LNKVFVDNFDRDPS L IWQYFGS AKGFFRQYPGIKWEPDENGVIAFDCRNR 
KWY I QAATS PKDWI LVDVS GSMKGLRLT I AKQTVS S I LDTLGDDD F FNI I AYNE E LHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACANKGFFTQISTLADV 

35 QENVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
S VDLS EVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVD KGKRVLVMTND YYYTD I KGTPFS L 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
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KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENI FNADHFPLWYRRAAEQ I PGS FVYS IPFSTGPVNKSNWT 
ASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETWCYLID 
NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGS FKRITLYDYQAMCRANKESSDGAHGLL 
5 DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMWVDSSCLCESVAPITMAPIEIRYNESLKCERL 
KAQKIRRRPESCHGFHPEENARECGGAPSLQAQTVLLLLPLLLMLFSR 

23 - human amino acid sequence 

10 MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
AS TLRLD F I ED PNFKNKVNYS YAAVQ I PTD I YKGS TVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGAS S PKDMVI IVDVSGSVSGL 

1 5 TLKLMKTSVCEMLDTLSDDDYV1WASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTG YKAG F E YAFD Q LQNSN ITRANCNKMIMMFTDGGEDRVQDVF E KYNWPNRTVRVFT F S 
VGQHNYDVTP LQVJMACANKGYYFE I PS IGAIRINTQE YLDVLGRPMVLAGKE AKQVQWTNV 
YED ALGLGL WTGTLPVFNLTQDGPGEKKNQL I LGVMGIDVALND I KRLTPNYTLGANGYV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 

20 KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 

25 MLAL YlSnSTS F YTRKE S YD YQ AAC AP Q P PGNLGAAPRGVFVPTVAD F LNLAWWTS AAAWS L FQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSWASYNAIIDCGNCSRLFHA 
QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGAHHHHHH 

30 24- mouse amino acid sequence 

MAGPGSLCCASRGASALLATALLYAALGDWRSEQQIPLSWKLWASAFGGEIKSIAAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKIMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFMNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWS ES LNKVFVDNFDRDP S L I WQ YFGS AKGFFRQ YPG I KWE PDENGVI AFDCRNR 
3 5 KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSS ILDTLGDDDFFNIITYNEELHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
L I TDGAVDT YDT I FAK YNWPDRKVR I FT YL I GRE AAF ADNLKWMAC ANKGFFTQ I S TL AD V 
QEISTVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLPQAQKLADDQGLVLMTTVAMPVFSKQN 
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ETRSKGILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRPLYEEGKKR 
RKPNYS S VDLS EVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTND Y Y YTD I K 
GTPFSLGVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLE 
AI KLYLKGKE PLLQCDKELIQEVLFDAWS APIEAYWTS LALNKS ENSDKGVEVAFLGTRT 
5 GLSRINLFVGAEQLTNQDFLKAGDKENIFNADHFPLWYRRAAEQIAGS FVYS IPFSTGTVN 
KSNWTASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETV 
NCYLIDNNGFILVSEDYTQTGDFFGEVEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSD 
SAHGLLDPYKAFLSAAKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAF 
VSERTIKETTGNIACEDCSKSFVIQQIPSSNLFMVWDSSCLCESVAPITMAPIEIRYNES 
10 LKC E RL KAQ K I RRRPE S CHGFHPE ENARE CGGAS HHHHHH 

25 - nucleotide sequence 

TCGCCACCATGGCGGTGCCGGCTC 

15 26 - nucleotide sequence 

TCGGAATTCCTCAGTGATGGTGATGGTGATGGGCCCCGCGGCCACAGTC 

27 - nucleotide sequence 

TCGCCACCATGGCCGGGCCGGGC 

20 

2 8 - nucleotide sequence 

TCTCAGTGATGGTGATGGTGATGCGATGCACCCCCACACTCTC 
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CLAIMS 

1) A purified or isolated nucleic acid encoding a mammalian secreted soluble cerebral 
5 cortical voltage-dependent calcium channel a 2 5-2, a 2 5-3 or a 2 5-4 subunit polypeptide. 

2) A purified or isolated nucleic acid according to claim 1, comprising a polynucleotide 
having at least 90% identity with the sequence encoding : 

- from amino-acid 1 to between amino-acids 1027 and 1062 of SEQ ID N°20 for a 2 5-2, 
10 - from amino-acid 1 to between amino-acids 984 and 101 9 of SEQ ID N°22 for a 2 8-3. 

3) A purified or isolated nucleic acid according to claim 1, having at least 90% identity 
with the sequence encoding : 

- from amino-acid 1 to between amino-acids 1047 and 1062 of SEQ ID N°20 for a 2 8-2, 
15 - from amino-acid 1 to between amino-acids 1004 and 1019 of SEQ ID N°22 for a 2 8-3. 

4) A purified or isolated nucleotide sequence according to claim 1 wherein said sequence 
is the sequence of SEQ ID N°l, SEQ ID N°2, SEQ ID N°3, SEQ ID N°7, SEQ ID N°8, 
SEQ ID N°9, SEQ ID N°13, SEQ ID N°14, SEQ ID N°15, SEQ ID N°19 or SEQ ID N°21. 

20 

5) A purified or isolated nucleic acid, having at least 90% identity with the nucleotide 
sequence of SEQ ID N° 19 or SEQ ID N°21. 

6) A purified or isolated polynucleotide comprising at least 10 consecutive nucleotides of 
25 the nucleotide sequence of SEQ ID N° 1 9 or SEQ ID N°2 1 . 

7) A polynucleotide probe or primer hybridizing, under stringent conditions, with the with 
the nucleotide sequence of SEQ ID N° 19 or SEQ ID N°21 . 

30 8) A method for the amplification of a nucleic acid encoding a mammalian secreted 
soluble cerebral cortical voltage-dependent calcium channel a 2 5-n subunit polypeptide 
wherein n is 2, 3 or 4, said method comprising the steps of: 
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(a) contacting a test sample suspected of containing the target secreted soluble a.25- 
n subunit nucleic acid, or a sequence complementary thereto, with an amplification 
reaction reagent comprising a pair of amplification primers located on either side of 
the oi25-n subunit nucleic acid region to be amplified, and 

5 (b) optionally, detecting the amplification products. 

9) A kit for the amplification of a nucleic acid encoding a secreted soluble 0128-n subunit 
polypeptide wherein n is 2, 3 or 4, or a complementary sequence thereto in a test sample, 
wherein said kit comprises: 

10 (a) a pair of oligonucleotide primers which can hybridize, under stringent 

conditions, to the secreted soluble ot25-n subunit nucleic acid region to be 
amplified; 

(b) optionally, the reagents necessary for performing the amplification reaction. 
15 10) A recombinant vector comprising a nucleic acid according to claim 1 . 

1 1) A recombinant host cell comprising a nucleic acid according to claim 1. 

12) A method for producing a secreted soluble a 2 5-n subunit wherein n is 2, 3 or 4, and 
20 said method comprises the steps of: 

(a) inserting the nucleic acid encoding the desired a.25-n subunit polypeptide in an 
appropriate vector; 

(b) culturing, in an appropriate culture medium, a host cell previously transformed 
or transfected with the recombinant vector of step (a); 

25 (c) harvesting the culture medium thus obtained or lyse the host cell, for example 

by sonication or osmotic shock; 

(d) separating or purifying, from said culture medium, or from the pellet of the 
resultant host cell lysate, the thus produced a 2 5-n subunit polypeptide of interest. 

30 13) A purified or isolated recombinant polypeptide comprising the amino acid sequence of 
a secreted soluble a 2 8-2, ot25-3 or a 2 8-4 subunit polypeptide. 

14) A recombinant polypeptide according to claim 14, having at least 80% amino-acid 
identity with a polypeptide comprising : 



* 
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- from amino acid 1 to between amino acids 1027 and 1062 of the amino acid sequence 
of SEQIDN o 20, or 

- from amino acid 1 to between amino acids 1019 and 1079 of the amino acid sequence 
of SEQIDN 0 22. 

5 

15) A recombinant polypeptide according to claim 14, wherein said recombinant 
polypeptide is selected from the group consisting of the amino acid sequences of SEQ ID 
n°4, SEQ ID n°5, SEQ ID n°6, SEQ ID n°10, SEQ ID n°ll, SEQ ID n°12, SEQ ID n°16, 
SEQ ID n°17 and SEQ ID n°18. 

10 

16) A method for the screening of ligands which bind a cerebral cortical voltage- 
dependent calcium channel a.25-n subunit wherein n is 2, 3 or 4, said method comprising 
the steps of: 

- contacting a secreted soluble recombinant calcium channel o^S-n subunit 
15 polypeptide with: 

a ligand of interest; and 
- a labelled compound which binds the ot25-n subunit; and 

- measuring the level of binding of the labelled compound to the a,25-n subunit. 



20 17) A method according to claim 16, wherein said method is a scintillation proximity 
assay. 

18) A method according to claim 16, wherein said method is a flashplate assay. 

25 19) A method according to claim 16, wherein said method is a filter binding assay. 

20) A method according to claim 16, wherein said secreted soluble recombinant 
calcium channel ot28-n subunit polypeptide is selected from polypeptides having at least 
80%, preferably 90%, more preferably 95%, and most preferably 98 or 99% amino-acid 
30 identity with the polypeptide comprising from amino acid 1 to between amino-acids 984 
and 1063, preferably between amino-acids 994 and 1054, and most preferably between 
amino-acids 1019 and 1054 of SEQ ID N°5 or SEQ ID N°16. 
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21) A method according to claim 16, wherein said secreted soluble recombinant 
calcium channel a25-n subunit polypeptide is selected from the group consisting of SEQ 
ID N°6, 7, 8, 9, 13, 14 and 15, with the polypeptides of SEQ ID N°9 and SEQ ID N°15 
being most preferred. 

22) A kit for the screening of ligands which bind bind a cerebral cortical voltage- 
dependent calcium channel 0C25-n subunit wherein n is 2, 3 or 4, said kit comprising: 

a secreted soluble recombinant calcium channel a.28-n subunit; and 
a labelled compound which binds to the cC28-n subunit. 
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ABSTRACT 

5 

Secreted soluble ot25-2, 0128-3 or a. 2 5-4 calcium channel subunit polypeptides and 

screening assays using same 

10 

Soluble 0626 sub-type polypeptides. 

Methods for cloning, expression and purification of freely soluble 0C25subtype 
15 polypeptides. 



A method for the screening of ligands which bind to soluble ct25 subtype polypeptides. 



